Showing 286 open source projects for "python data analysis"

View related business solutions
  • Easy management of simple and complex projects Icon
    Easy management of simple and complex projects

    We help different businesses become digital, manage projects, teams, communicate effectively and control tasks online.

    Plan more projects with Worksection. Use Gantt chart and Kanban boards to organize your projects, get your team onboard and assign tasks and due dates.
    Learn More
  • Endpoint Protection Software for Businesses | HYPERSECURE Icon
    Endpoint Protection Software for Businesses | HYPERSECURE

    DriveLock protects systems, data, end devices from data loss and misuse.

    The HYPERSECURE endpoint protection platform is a comprehensive suite of products and services enhanced by European third-party solutions. It ensures our customers’ IT security, regulatory compliance, and digital sovereignty.
    Learn More
  • 1
    ECharts

    ECharts

    A powerful, interactive charting and visualization library for browser

    ECharts is a free and open source charting and visualization library that gives you an easy way to add interactive, intuitive, custom charts to your commercial products, projects, presentations and more. It offers a rich set of features that includes rendering ability for ten-million-level data, Wechart and Powerpoint support, multi-dimension data analysis, and more. It also has a number of extensions for various applications. ECharts is written in pure JavaScript, and is based on zrender, a new and lightweight canvas library.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    Matplotlib

    Matplotlib

    matplotlib: plotting with Python

    Matplotlib is a comprehensive library for creating static, animated, and interactive visualizations in Python. Matplotlib makes easy things easy and hard things possible. Matplotlib ships with several add-on toolkits, including 3D plotting with mplot3d, axes helpers in axes_grid1 and axis helpers in axisartist. A large number of third party packages extend and build on Matplotlib functionality, including several higher-level plotting interfaces (seaborn, HoloViews, ggplot, ...), and a...
    Downloads: 24 This Week
    Last Update:
    See Project
  • 3
    Graphene

    Graphene

    GraphQL in Python Made Easy

    Graphene is a Python library for building GraphQL APIs fast and easily, using a code-first approach. Instead of writing GraphQL Schema Definition Langauge (SDL), Python code is written to describe the data provided by your server. Graphene helps you use GraphQL effortlessly in Python, but what is GraphQL? GraphQL is a data query language developed internally by Facebook as an alternative to REST and ad-hoc webservice architectures.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    Mimesis

    Mimesis

    High-performance fake data generator for Python

    Mimesis is an open source high-performance fake data generator for Python, able to provide data for various purposes in various languages. It's currently the fastest fake data generator for Python, and supports many different data providers that can produce data related to people, food, transportation, internet and many more. Mimesis is really easy to use, with everything you need just an import away.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Solve Your Real-world Business Challenges From A Single Timesheet With Powerful Built In Integration Icon
    Solve Your Real-world Business Challenges From A Single Timesheet With Powerful Built In Integration

    Timecontrol Is Your Multipurpose Timesheet Software

    TimeControl is a multi-purpose timesheet system designed to serve both Finance and Project Management. TimeControl has been designed to serve many purposes simultaneously. TimeControl tracks time on a task-by-task, project-by-project basis. Yet, despite its project-based controls, it remains a financial timesheet with all the controls necessary to fulfill the stringent needs of payroll, human resources, billing and finance. TimeControl is available both for subscription in the cloud or for purchase for an on premise installation and includes both a browser interface and the free TimeControl Mobile App for iOS and Android devices.
    Learn More
  • 5
    LangExtract

    LangExtract

    A Python library for extracting structured information

    LangExtract is a Python library developed by Google that leverages large language models (LLMs) to extract structured information from unstructured text—such as clinical notes, research papers, or literary works—based on user-defined instructions. It is designed to transform free-form text into reliable, schema-constrained data while maintaining traceability back to the source material.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 6
    cuDF

    cuDF

    GPU DataFrame Library

    Built based on the Apache Arrow columnar memory format, cuDF is a GPU DataFrame library for loading, joining, aggregating, filtering, and otherwise manipulating data. cuDF provides a pandas-like API that will be familiar to data engineers & data scientists, so they can use it to easily accelerate their workflows without going into the details of CUDA programming. For additional examples, browse our complete API documentation, or check out our more detailed notebooks. cuDF can be installed with conda (miniconda, or the full Anaconda distribution) from the rapidsai channel. cuDF is supported only on Linux, and with Python versions 3.7 and later. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 7
    JC

    JC

    CLI tool and python library

    ...The JC parsers can also be used as python modules. In this case, the output will be a python dictionary, or a list of dictionaries, instead of JSON. Two representations of the data are available. The default representation uses a strict schema per parser and converts known numbers to int/float JSON values. Certain known values of None are converted to JSON null, known boolean values are converted, and, in some cases, additional semantic context fields are added.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 8
    Peroxide

    Peroxide

    Rust numeric library with high performance and friendly syntax

    Rust numeric library contains linear algebra, numerical analysis, statistics and machine learning tools with R, MATLAB, Python-like macros. Peroxide uses a 1D data structure to represent matrices, making it straightforward to integrate with BLAS (Basic Linear Algebra Subprograms). This means that Peroxide can guarantee excellent performance for linear algebraic computations by leveraging the optimized routines provided by BLAS.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 9
    Metaflow

    Metaflow

    A framework for real-life data science

    Metaflow is a human-friendly Python library that helps scientists and engineers build and manage real-life data science projects. Metaflow was originally developed at Netflix to boost productivity of data scientists who work on a wide variety of projects from classical statistics to state-of-the-art deep learning.
    Downloads: 4 This Week
    Last Update:
    See Project
  • Tremendous is the global payouts platform for businesses sending gift cards and money at scale. Icon
    Tremendous is the global payouts platform for businesses sending gift cards and money at scale.

    Getting started is simple: add a funding method and place your first order in minutes.

    Trusted by 20,000+ leading organizations, Tremendous has delivered billions of rewards and enables businesses to reach recipients across 230+ countries and regions. Recipients have 2,500+ payout options to choose from, including gift cards, prepaid cards, cash transfers, and charitable donations.
    Learn More
  • 10
    dplyr

    dplyr

    dplyr: A grammar of data manipulation

    dplyr is an R package that provides a consistent and intuitive grammar for data manipulation, enabling users to filter, arrange, summarize, and transform data efficiently. Part of the tidyverse ecosystem, dplyr simplifies complex data operations through a clear and readable syntax, whether working with data frames, tibbles, or databases. It is widely used in data science and statistical analysis workflows.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    Yoast WordPress SEO

    Yoast WordPress SEO

    Yoast SEO for WordPress

    The Yoast SEO plugin is the most popular SEO tool for WordPress, offering comprehensive tools to optimize content for search engines. It provides real-time page analysis, readability checks, and automated metadata handling to improve website visibility.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 12
    Deequ

    Deequ

    Deequ is a library built on top of Apache Spark

    ...It also includes a little domain-specific language called DQDL (Data Quality Definition Language) which allows declarative specification of quality rules. Users typically run Deequ before feeding data downstream (to ML pipelines, analytics, or production systems), enabling early detection and isolation of data errors. There is also a Python wrapper, PyDeequ, for users who prefer working from Python environments.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    Behaviour Suite Reinforcement Learning

    Behaviour Suite Reinforcement Learning

    bsuite is a collection of carefully-designed experiments

    bsuite is a research framework developed by Google DeepMind that provides a comprehensive collection of experiments for evaluating the core capabilities of reinforcement learning (RL) agents. Its main goal is to identify, measure, and analyze fundamental aspects of learning efficiency and generalization in RL algorithms. The library enables researchers to benchmark their agents on standardized tasks, facilitating reproducible and transparent comparisons across different approaches. Each...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 14
    Parsera

    Parsera

    Lightweight library for scraping web-sites with LLMs

    Scrape data from any website with only a link and column descriptions. Parsera is a tool designed to scrape web content, specifically handling poorly structured or messy websites.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 15
    redis-py

    redis-py

    Redis Python client

    redis-py is the official Python client for interacting with Redis, the in-memory data structure store. It supports all Redis commands and data types, making it easy to build caching, messaging, or real-time analytics features in Python applications. With both synchronous and asyncio support, redis-py is suited for modern Python projects and integrates smoothly into web frameworks, task queues, and backend services.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    PyExcelerate

    PyExcelerate

    Accelerated Excel XLSX Writing Library for Python 2/3

    Accelerated Excel XLSX writing library for Python. PyExcelerate is a Python for writing Excel-compatible XLSX spreadsheet files, with an emphasis on speed.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 17
    Pysheeet

    Pysheeet

    Python Cheat Sheet

    Pysheeet is a community-driven collection of Python code snippets covering common patterns and tasks like sockets, file I/O, data structures, and more. Each snippet is concise and battle-tested, designed to save coding time and reduce boilerplate. With documentation hosted on Read the Docs and an active GitHub repo, it’s a go-to resource for Python developers.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    Pants Build System

    Pants Build System

    The Pants Build System

    Pants 2 is a fast, scalable, user-friendly build system for codebases of all sizes. It's currently focused on Python, Go, Java, Scala, Kotlin, Shell, and Docker, with support for other languages and frameworks coming soon. A lot of effort has gone into making Pants easy to adopt, easy to use and easy to extend. We're super excited to bring Pants' distinctive features to Go, Java, Python, Scala, Kotlin, and Shell users. Pants requires very minimal BUILD file metadata/boilerplate. ...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 19
    PyPDF

    PyPDF

    A pure-python PDF library capable of splitting, merging, cropping

    pypdf is a pure Python library for working with PDF files, allowing developers to split, merge, rotate, encrypt, and extract content from PDFs. It’s an actively maintained fork of PyPDF2, improving performance, compatibility, and support for modern PDF standards. Suitable for both automation scripts and full-featured applications, pypdf handles PDFs without requiring external dependencies.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 20
    Awesome Fraud Detection Research Papers

    Awesome Fraud Detection Research Papers

    A curated list of data mining papers about fraud detection

    A curated list of data mining papers about fraud detection from several conferences.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Open3D

    Open3D

    A modern library for 3D data processing

    Open3D is an open-source library that supports rapid development of software that deals with 3D data. The Open3D frontend exposes a set of carefully selected data structures and algorithms in both C++ and Python. The backend is highly optimized and is set up for parallelization. Open3D was developed from a clean slate with a small and carefully considered set of dependencies. It can be set up on different platforms and compiled from source with minimal effort. ...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 22
    django-import-export

    django-import-export

    Django application and library for importing and exporting data

    django-import-export is a Django application and library for importing and exporting data with included admin integration. Support multiple formats (Excel, CSV, JSON, and everything else that tablib supports) Admin integration for importing. Preview import changes. Admin integration for exporting. Export data respecting admin filters. By default all records will be imported, even if no changes are detected. This can be changed setting the skip_unchanged option. Also, the report_skipped...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    DynamicalSystems.jl

    DynamicalSystems.jl

    Award winning software library for nonlinear dynamics timeseries

    DynamicalSystems.jl is an award-winning Julia software library for nonlinear dynamics and nonlinear time series analysis. To install DynamicalSystems.jl, run import Pkg; Pkg.add("DynamicalSystems"). To learn how to use it and see its contents visit the documentation, which you can either find online or build locally by running the docs/make.jl file. DynamicalSystems.jl is part of JuliaDynamics, an organization dedicated to creating high-quality scientific software. All implemented algorithms...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 24
    Easy3D

    Easy3D

    Efficient library for processing 3D data

    Easy3D is a lightweight, easy-to-use, and efficient library for processing and rendering 3D data, implemented in C++ with Python bindings. It is designed for tasks such as 3D modeling, geometry processing, and rendering, emphasizing simplicity and efficiency. Easy3D serves as a valuable tool for research, education, and the development of sophisticated 3D applications, providing a solid foundation for handling 3D data.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 25
    OpenTelemetry Collector distributions

    OpenTelemetry Collector distributions

    OpenTelemetry Collector Official Releases

    High-quality, ubiquitous, and portable telemetry to enable effective observability. OpenTelemetry is a collection of APIs, SDKs, and tools. Use it to instrument, generate, collect, and export telemetry data (metrics, logs, and traces) to help you analyze your software’s performance and behavior. Create and collect telemetry from your services and software, then forward it to a variety of analysis tools. OpenTelemetry integrates with many popular libraries and frameworks, and supports code-based and zero-code instrumentation.
    Downloads: 99 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB