Showing 547 open source projects for "python data analysis"

View related business solutions
  • Next-Gen Encryption for Post-Quantum Security | CLEAR by Quantum Knight Icon
    Next-Gen Encryption for Post-Quantum Security | CLEAR by Quantum Knight

    Lock Down Any Resource, Anywhere, Anytime

    CLEAR by Quantum Knight is a FIPS-140-3 validated encryption SDK engineered for enterprises requiring top-tier security. Offering robust post-quantum cryptography, CLEAR secures files, streaming media, databases, and networks with ease across over 30 modern platforms. Its compact design, smaller than a single smartphone image, ensures maximum efficiency and low energy consumption.
    Learn More
  • Data management solutions for confident marketing Icon
    Data management solutions for confident marketing

    For companies wanting a complete Data Management solution that is native to Salesforce

    Verify, deduplicate, manipulate, and assign records automatically to keep your CRM data accurate, complete, and ready for business.
    Learn More
  • 1
    PDF4QT

    PDF4QT

    Open source PDF editor

    PDF4QT is open source PDF editor based on Qt framework. It contains a C++ library, applications for viewing/editing PDF documents, and a command line tool. PDF4QT is an open-source PDF editor for Windows/Linux. It is a modern solution for viewing/editing/rendering PDF documents, for users and developers alike. For developers, there is a C++ library and a command line tool for use in scripts. For users, there are four applications offering many features. The project is hosted on Github and...
    Downloads: 109 This Week
    Last Update:
    See Project
  • 2
    UltraJSON

    UltraJSON

    Ultra fast JSON decoder and encoder written in C with Python bindings

    UltraJSON is an ultra-fast JSON encoder and decoder written in pure C with bindings for Python 3.7+. May be used as a drop-in replacement for most other JSON parsers for Python. Used to enable special encoding of "unsafe" HTML characters into safer Unicode sequences. Limits output to ASCII and escapes all extended characters above 127. Default is True. If your end format supports UTF-8, setting this option to false is highly recommended to save space. By default, debugging symbols are...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    Miller

    Miller

    Miller is like awk, sed, cut, join, and sort for name-indexed data

    Miller is like awk, sed, cut, join, and sort for data formats such as CSV, TSV, JSON, JSON Lines, and positionally-indexed. With Miller, you get to use named fields without needing to count positional indices, using familiar formats such as CSV, TSV, JSON, JSON Lines, and positionally-indexed. Then, on the fly, you can add new fields which are functions of existing fields, drop fields, sort, aggregate statistically, pretty-print, and more. Miller operates on key-value-pair data while the...
    Downloads: 25 This Week
    Last Update:
    See Project
  • 4
    Texify

    Texify

    Math OCR model that outputs LaTeX and markdown

    Texify is an OCR model that converts images or pdfs containing math into markdown and LaTeX that can be rendered by MathJax ($$ and $ are delimiters). It can run on CPU, GPU, or MPS.
    Downloads: 3 This Week
    Last Update:
    See Project
  • The AI workplace management platform Icon
    The AI workplace management platform

    Plan smart spaces, connect teams, manage assets, and get insights with the leading AI-powered operating system for the built world.

    By combining AI workflows, predictive intelligence, and automated insights, OfficeSpace gives leaders a complete view of how their spaces are used and how people work. Facilities, IT, HR, and Real Estate teams use OfficeSpace to optimize space utilization, enhance employee experience, and reduce portfolio costs with precision.
    Learn More
  • 5
    RenderCV

    RenderCV

    LaTeX CV generator from a YAML/JSON input file

    RenderCV is a LaTeX CV/resume framework. It allows you to create a high-quality CV as a PDF from a YAML file with full Markdown syntax support and complete control over the LaTeX code. RenderCV offers built-in LaTeX and Markdown templates ready to produce high-quality CVs. However, the templates are entirely arbitrary and can easily be updated to leverage RenderCV's capabilities with your custom CV themes.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 6
    mistletoe

    mistletoe

    A fast, extensible and spec-compliant Markdown parser in pure Python

    mistletoe is a Markdown parser in pure Python, designed to be fast, spec-compliant and fully customizable. Apart from being the fastest CommonMark-compliant Markdown parser implementation in pure Python, mistletoe also supports easy definitions of custom tokens. Parsing Markdown into an abstract syntax tree also allows us to swap out renderers for different output formats, without touching any of the core components.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    openvpn-monitor

    openvpn-monitor

    openvpn-monitor is a web based OpenVPN monitor

    openvpn-monitor is a simple Python program to generate HTML that displays the status of an OpenVPN server, including all current connections. It uses the OpenVPN management console. It typically runs on the same host as the OpenVPN server, however, it does not necessarily need to. OpenVPN-monitor is a web-based OpenVPN monitor, that shows current connection information, such as users, location, and data transferred.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Rapid LaTeX OCR

    Rapid LaTeX OCR

    Formula recognition based on LaTeX-OCR and ONNXRuntime

    Formula recognition based on LaTeX-OCR and ONNXRuntime. rapid_latex_ocr is a tool to convert formula images to latex format. The reasoning code in the repo is modified from LaTeX-OCR, the model has all been converted to ONNX format, and the reasoning code has been simplified, Inference is faster and easier to deploy. The repo only has codes based on ONNXRuntime or OpenVINO inference in onnx format and does not contain training model codes. If you want to train your own model, please move to...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 9
    HTTPie CLI

    HTTPie CLI

    Modern, user-friendly command-line HTTP client for the API era

    HTTPie (pronounced aitch-tee-tee-pie) is a command-line HTTP client. Its goal is to make CLI interaction with web services as human-friendly as possible. HTTPie is designed for testing, debugging, and generally interacting with APIs & HTTP servers. The HTTP & HTTPS commands allow for creating and sending arbitrary HTTP requests. They use simple and natural syntax and provide formatted and colorized output.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Collect! is a highly configurable debt collection software Icon
    Collect! is a highly configurable debt collection software

    Everything that matters to debt collection, all in one solution.

    The flexible & scalable debt collection software built to automate your workflow. From startup to enterprise, we have the solution for you.
    Learn More
  • 10
    Nano PDF Editor

    Nano PDF Editor

    Edit PDF files with Nano Banana

    Nano PDF Editor is a minimalist, portable PDF viewer and toolkit that focuses on simplicity, speed, and ease of integration for applications that need basic PDF rendering without heavy dependencies. It provides core functionality such as page navigation, zooming, text selection, and rendering directly to native graphics surfaces, making it suitable for lightweight PDF viewing scenarios on desktop or embedded platforms. Designed to be easily embedded into larger software projects, Nano-PDF...
    Downloads: 17 This Week
    Last Update:
    See Project
  • 11
    Connexion

    Connexion

    Swagger/OpenAPI First framework for Python on top of Flask

    Connexion is a framework on top of Flask that automagically handles HTTP requests defined using OpenAPI (formerly known as Swagger), supporting both v2.0 and v3.0 of the specification. Connexion allows you to write these specifications, then maps the endpoints to your Python functions. This is what makes it unique from other tools that generate the specification based on your Python code. You are free to describe your REST API with as much detail as you want and then Connexion guarantees...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    Awkward Array

    Awkward Array

    Manipulate JSON-like data with NumPy-like idioms

    Awkward Array is a library for nested, variable-sized data, including arbitrary-length lists, records, mixed types, and missing data, using NumPy-like idioms. Arrays are dynamically typed, but operations on them are compiled and fast. Their behavior coincides with NumPy when array dimensions are regular and generalizes when they're not.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Remarshal

    Remarshal

    Convert between CBOR, JSON, MessagePack, TOML, and YAML

    Convert between CBOR, JSON, MessagePack, TOML, and YAML. When installed, provides the command-line command remarshal as well as the short commands {cbor,json,msgpack,toml,yaml}2{cbor,json,msgpack,toml,yaml}. You can perform format conversion, reformatting, and error detection using these commands. CBOR, MessagePack, and YAML with binary fields cannot be converted to JSON or TOML. Binary fields are converted between CBOR, MessagePack, and YAML.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    Unredact

    Unredact

    A simple tool for reading in poorly redacted documents

    Unredact is a specialized tool that attempts to reconstruct redacted or obscured text in images, PDFs, or screenshots using a combination of image processing and generative AI inference to suggest plausible completions of blurred, black-boxed, or jumbled content. Unlike traditional optical character recognition (OCR), which only reads visible text, Unredact focuses on inferring missing content where redaction has been applied by analyzing surrounding context, font characteristics, and...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 15
    Markdown package LaTeX

    Markdown package LaTeX

    Package for converting and rendering markdown documents in TeX

    The Markdown package converts CommonMark markup to TeX commands. The functionality is provided both as a Lua module, and as plain TeX, LaTeX, and ConTeXt macro packages that can be used to directly typeset TeX documents containing markdown markup. Unlike other convertors, the Markdown package does not require any external programs and makes it easy to redefine how each and every markdown element is rendered. Creative abuse of the markdown syntax is encouraged.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 16
    PackageDev

    PackageDev

    Tools to ease the creation of snippets, syntax definitions, etc.

    PackageDev provides syntax highlighting and other helpful utility for Sublime Text resource files. Resource files are ways of configuring the Sublime Text text editor to various extends, including but not limited to: custom syntax definitions, context menus (and the main menu), and key bindings. Thus, this package is ideal for package developers, but even normal users of Sublime Text who want to configure it to their liking should find it very useful.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 17
    WriteTeX

    WriteTeX

    An Inkscape extension: Latex/Tex editor for Inkscape

    Due to an incompatible change of the Inkscape extension API, this extension has to split into two versions. For Inkscape versions lower than 1.0, users should use the files in the 0.9.x folder, the other users should use files in the 1.0.x folder.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    TexSoup

    TexSoup

    Fault-tolerant Python3 package for searching LaTeX documents

    Navigate, Search, and Modify LaTeX Documents in Python. Easy and reliable: No C extensions, no installation dependencies, and 100% test coverage. TexSoup is a fault-tolerant, Python3 package for searching, navigating, and modifying LaTeX documents. You can skip installation and try TexSoup directly, using the pytwiddle demo.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    PSRCHIVE is an application suite and development library for use in the storage and analysis of pulsar astronomical data. It includes an extensive range of algorithms and tools commonly used in a wide variety of experiments.
    Leader badge
    Downloads: 8 This Week
    Last Update:
    See Project
  • 20
    Infer

    Infer

    A static analyzer for Java, C, C++, and Objective-C

    Infer is a static analysis tool - if you give Infer some Java or C/C++/Objective-C code it produces a list of potential bugs. Anyone can use Infer to intercept critical bugs before they have shipped to users, and help prevent crashes or poor performance. Infer checks for null pointer exceptions, resource leaks, annotation reachability, missing lock guards, and concurrency race conditions in Android and Java code. Infer checks for null pointer dereferences, memory leaks, coding conventions...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    tqdm

    tqdm

    A Fast, Extensible Progress Bar for Python and CLI

    tqdm is a fast, extensible progress bar for Python and CLI that enables you to see the progress of your loops in a clear and smart way. Simply wrap any iterable with tqdm(iterable), and sit back and watch that progress meter go! tqdm can be wrapped around any iterable, or executed as a module with pipes. Just by inserting tqdm (or python -m tqdm) between pipes will pass through all stdin to stdout while printing progress to stderr. tqdm does not require any dependencies, has a very...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    DocArray

    DocArray

    The data structure for multimodal data

    DocArray is a library for nested, unstructured, multimodal data in transit, including text, image, audio, video, 3D mesh, etc. It allows deep-learning engineers to efficiently process, embed, search, recommend, store, and transfer multimodal data with a Pythonic API. Door to multimodal world: super-expressive data structure for representing complicated/mixed/nested text, image, video, audio, 3D mesh data. The foundation data structure of Jina, CLIP-as-service, DALL·E Flow, DiscoArt etc. Data...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Vanilla.PDF

    Vanilla.PDF

    Cross-platform SDK for creating and modifying PDF documents

    Vanilla.PDF is a modern, high-performance, open-source C++17 SDK designed for creating, editing, signing, and analyzing PDF documents across multiple platforms. It requires no external runtime dependencies, making it lightweight and ideal for embedding into desktop applications, servers, or automation pipelines. The SDK offers full cross-platform support including Windows, Linux, macOS, and Android, with builds available for major compilers and architectures. Vanilla.PDF supports advanced...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 24
    SAFRS

    SAFRS

    SqlAlchemy Flask-Restful Swagger Json:API OpenAPI

    SAFRS exposes SQLAlchemy database models as a JSON:API web service and generates the corresponding swagger/OpenAPI. Database objects such as the User class from the demo.py example can be extended to include relationships with other objects. The demo_relationship.py contains the following extension of the User class where a relationship with the Book class is implemented.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    Docker Compose

    Docker Compose

    Define and run multi-container applications with Docker

    Docker Compose is an open source tool for defining and running multi-container applications with Docker. Compose lets you use a YAML file to configure your application’s services, and then create and start all the services from your configuration with just a single command. Compose works great in all environments: production, staging, testing, development, and on CI workflows. Compose has commands for every stage of your application lifecycle, from starting, stopping and rebuilding services,...
    Downloads: 21 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB