Showing 528 open source projects for "python data analysis"

View related business solutions
  • Securing the Cloud Made Easy Icon
    Securing the Cloud Made Easy

    Multi-cloud security delivered — now and in the future.

    Designed for organizations operating in the cloud who need complete, centralized visibility of their entire cloud estate and want more time and resources dedicated to remediating the actual risks that matter, Orca Security is an agentless cloud Security Platform that provides security teams with 100% coverage their entire cloud environment.
    Learn More
  • Native Teams: Payments and Employment for International Teams Icon
    Native Teams: Payments and Employment for International Teams

    Expand Your Global Team in 85+ Countries

    With Native Teams’ Employer of Record (EOR) service, you can compliantly hire in 85+ countries without setting up a legal entity. From dedicated employee support and localised benefits to tax optimisation, we help you build a global team that feels truly cared for.
    Learn More
  • 1
    mk1utils

    mk1utils

    mk1utils: tools for the Wersi MK1, EX20, DX10, EX10R, DX5

    mk1utils: tools for the Wersi MK1, EX20, DX10, EX10R, DX5 Included tools: * mk1parse: parse SYX and ROM/RAM files of Wersi MK1, EX20, DX10, EX10R, DX5 * mk1reqall: create SYX and MIDI files for requesting voices from Wersi MK1, EX20, DX10, EX10R * romcvp_dx10_to_mk1: convert CVs+Presets ROM/RAM from Wersi DX10 to MK1 format Special features: Decoding of envelope microcode (modul, special, and standard), rhythms, and sequences. Export of rhythms to MIDI files and waves to WAV files....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Bots open source edi translator

    Bots open source edi translator

    Bots is a complete translator for edi: edifact, x12, xml, tradacoms

    Bots is a complete translator for edi (Electronic Data Interchange). EDI data formats eg: edifact, x12, tradacoms, xml. Mail: http://groups.google.com/group/botsmail Web-site: http://bots.sourceforge.net Wiki: http://bots.readthedocs.io Develop: https://github.com/eppye-bots/bots
    Downloads: 14 This Week
    Last Update:
    See Project
  • 3
    scriptLattes is an open-source system to create academic reports of groups based on curricula vitae of the Brazilian Lattes Database.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 4
    Python ADB

    Python ADB

    Python ADB + Fastboot implementation

    ...Under the hood it speaks the ADB protocol directly and can connect via USB or over TCP, which is useful for lab setups and headless servers. Because it’s Python, you can compose device actions with your favorite testing, scraping, or data-collection libraries in one process. The project also includes utilities for robust connection handling and timeouts so flaky USB links don’t derail long runs. It’s well-suited to CI test farms, large-scale telemetry, and custom device control workflows.
    Downloads: 8 This Week
    Last Update:
    See Project
  • PeerGFS PEER Software - File Sharing and Collaboration Icon
    PeerGFS PEER Software - File Sharing and Collaboration

    One Solution to Simplify File Management and Orchestration Across Edge, Data Center, and Cloud Storage

    PeerGFS is a software-only solution developed to solve file management/file replication challenges in multi-site, multi-platform, and hybrid multi-cloud environments.
    Learn More
  • 5
    PDF-Shuffler
    PDF-Shuffler is a small python-gtk application, which helps the user to merge or split pdf documents and rotate, crop and rearrange their pages using an interactive and intuitive graphical interface. It is a frontend for python-pyPdf.
    Leader badge
    Downloads: 50 This Week
    Last Update:
    See Project
  • 6
    unfluff

    unfluff

    Automatically extract body content (and other cool stuff) from HTML

    unfluff is a Node.js library designed to automatically extract the main content from an HTML document — stripping away navigation bars, ads, footers and other boilerplate to leave you with the “body content”, metadata (title, author, date) and other useful fields. It’s a tool very much aimed at content-analysis, web scraping, building datasets, or repurposing article text for downstream processing (like machine-learning or summarization). The API is simple: you feed in raw HTML and it...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    cquery

    cquery

    C/C++ language server supporting multi-million line code base

    C/C++ language server supporting multi-million line code base, powered by libclang. Emacs, Vim, VSCode, and others with language server protocol support. Cross-references, completion, diagnostics, semantic highlighting, and more. cquery is a highly-scalable, low-latency language server for C/C++/Objective-C. It is tested and designed for large codebases like Chromium. cquery provides accurate and fast semantic analysis without interrupting workflow. cquery implements almost the entire...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    AerinSistemas-Noname

    Elasticsearch to Pandas dataframe or CSV

    API and command line utility, written in Python, for querying Elasticsearch exporting result as documents into a CSV file. The search can be done using logical operators or ranges, in combination or alone. The output can be limited to the desired attributes. Also ToT can insert the querying to a Pandas Dataframe or/and save its in a HDF5 container (under development).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    pyspider

    pyspider

    A powerful Spider(Web Crawler) system in Python

    pyspider is a powerful Spider(Web Crawler) system in Python. Components are connected by message queue. Every component, including message queue, is running in their own process/thread, and replaceable. That means, when process is slow, you can have many instances of processor and make full use of multiple CPUs, or deploy to multiple machines. This architecture makes pyspider really fast. benchmarking.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Unrivaled Embedded Payments Solutions | NMI Icon
    Unrivaled Embedded Payments Solutions | NMI

    For SaaS builders, software companies, ISVs and ISOs who want to embed payments into their tech stack

    NMI Payments is an embedded payments solution that lets SaaS platforms, Software companies and ISVs integrate, brand, and manage payment acceptance directly within their software—without becoming a PayFac or building complex infrastructure. As a full-stack processor, acquirer, and technology partner, NMI handles onboarding, compliance, and risk so you can stay focused on growth. The modular, white-label platform supports omnichannel payments, from online, mobile and in-app to in-store and unattended. Choose from full-code, low-code, or no-code integration paths and launch in weeks, not months. Built-in risk tools, flexible monetization, and customizable branding help you scale faster while keeping full control of your experience. With NMI’s developer-first tools, sandbox testing, and modern APIs, you can embed payments quickly and confidently.
    Learn More
  • 10
    jsondata

    jsondata

    Modular JSON by trees and branches, pointers and patches

    The 'jsondata' package provides for the modular in-memory processing of JSON data by trees, branches, pointers, and patches. The main interface classes are: - JSONData - Core for RFC7159 based data structures. Provides modular data components. - JSONDataSerializer - Core for RFC7159 based data persistence. Provides modular data serialization. - JSONPointer - RFC6901 for addressing by pointer paths. Provides pointer arithmetics. - JSON Relative Pointer -...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    FlexibleParser_Java

    FlexibleParser_Java

    Group of multi-purpose Java-converted-from-C# parsing libraries.

    FlexibleParser (Java) is currently formed by the following independent libraries/parts: - UnitParser. It allows to easily deal with a wide variety of situations involving units of measurement. - NumberParser. It provides a common framework for all the .NET numeric types. AUTHORSHIP & COPYRIGHT I, Alvaro Carballo Garcia (varocarbas), am the sole author of each single bit of this code. All the contents of this repository can be considered public domain. For more information about my...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    FlexibleParser

    FlexibleParser

    FlexibleParser is a group of multi-purpose .NET parsing libraries.

    FlexibleParser (C#/.NET) is currently formed by the following independent libraries/parts: - UnitParser. It allows to easily deal with a wide variety of situations involving units of measurement. - NumberParser. It provides a common framework for all the .NET numeric types. - DateParser. It enhances the default .NET date/time support. AUTHORSHIP & COPYRIGHT I, Alvaro Carballo Garcia (varocarbas), am the sole author of each single bit of this code. All the contents of this...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    survol

    RDF-based framework monitoring business systems activity

    A Python agent and a web interface aiming to help the analysis and investigation of a legacy application. A set of machines, processes, databases, programs etc ... all communicating with each other, manipulating your data, and whose software architecture has become, with time, complicated, difficult to understand, and undocumented. Data are aggregated with an RDF inference engine, creating a global vision of the business information processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    **CODE MOVED TO GITHUB: https://github.com/bitextor ** Bitextor is an application created to generate translation memories using multilingual websites as a corpus source. It downloads an entire website and applies a set of heuristics (based mainly on HTML tag structure and text block length) to find bitexts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    PyAIMLng

    The Next Generation of Python AIML Interpreter

    A Python AIML interpreter with non-compliant extensions. PyAIMLng is an interpreter for AIML (the Artificial Intelligence Markup Language), forked from Cort Stratton's PyAIML. PyAIMLng adds additional features which are not part of the AIML 1.0.1 specification in order to provide the bot master with a rich set of tools from which to build a more believable AIML bot.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    transitfeed

    Tools for reading, validating & writing transit schedule info in GTFS

    transitfeed helps to make public transit information projects more successful for agencies and other interested parties. It provides a library to help you parse, validate, and generate General Transit Feed Spec (GTFS) feed files. It contains several parts, including TransitFeed (Python package for reading, writing, and validating feeds); Feedvalidator (command line tool that checks a GTFS feed for problems); ScheduleViewer (application for exploring a feed on a map) and more; each with their...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    FilteredPush
    Network software for annotation and quality control of distributed data. Supported by NSF: DBI 0646266 and NSF: DBI 0960535. http://wiki.filteredpush.org
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    IdeoType is a book compiler that converts manuscript (XHTML) to book (PDF) on the fly.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    WikiSQL

    WikiSQL

    A large annotated semantic parsing corpus for developing NL interfaces

    A large crowd-sourced dataset for developing natural language interfaces for relational databases. WikiSQL is the dataset released along with our work Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning. Regarding tokenization and Stanza, when WikiSQL was written 3-years ago, it relied on Stanza, a CoreNLP python wrapper that has since been deprecated. If you'd still like to use the tokenizer, please use the docker image. We do not anticipate switching...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20

    redfish-lab

    Get started with the Redfish RESTful API from the DMTF

    Redfish-lab allows a smooth ramp-up with the Redfish RESTful API on an HPE ProLiant server, including UEFI/BIOS configuration with various scripting languages like PowerShell and Python. Small tutorials/articles are also proposed in the Wiki section.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    pyCGNS

    pyCGNS

    A Python package for CGNS

    pyCGNS is now on github: https://pycgns.github.io/index.html https://github.com/pyCGNS/pyCGNS
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Extended Memory Semantics (EMS)

    Extended Memory Semantics (EMS)

    Persistent shared object memory and parallelism for Node.js and Python

    EMS makes possible persistent shared memory parallelism between Node.js, Python, and C/C++. Extended Memory Semantics (EMS) unifies synchronization and storage primitives to address several challenges of parallel programming. A modern multi-core server has 16-32 cores and nearly 1TB of memory, equivalent to an entire rack of systems from a few years ago. As a consequence, jobs formerly requiring a Map-Reduce cluster can now be performed entirely in shared memory on a single server without...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    PyXB (“pixbee”) is a pure Python package that generates Python source code for classes that correspond to data structures defined by XMLSchema. In concept it is similar to JAXB for Java and CodeSynthesis XSD for C++.
    Leader badge
    Downloads: 6 This Week
    Last Update:
    See Project
  • 24

    EasyHTML

    A python package for building DOM of the HTML documents

    A python package that provides an easy access to elements of HTML and XHTML documents through the Document Object Model.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    CoRoPa stands for Computational Rough Paths. The aim of CoRoPa is to provide a software framework for various ideas related to Rough Path Theory, including rough differential equations and the digital description of serial data streams.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB