Showing 40 open source projects for "python data analysis"

View related business solutions
  • Network Discovery Software | JDisc Discovery Icon
    Network Discovery Software | JDisc Discovery

    JDisc Discovery supports the IT organizationss of medium-sized businesses and large-scale enterprises.

    JDisc Discovery is a comprehensive network inventory and IT asset management solution designed to help organizations gain clear, up-to-date visibility into their IT environment. It automatically scans and maps devices across the network, including servers, workstations, virtual machines, and network hardware, to create a detailed inventory of all connected assets. This includes critical information such as hardware configurations, software installations, patch levels, and relationshipots between devices.
    Learn More
  • RouteGenie NEMT software Icon
    RouteGenie NEMT software

    Modern software for non-emergency medical transportation providers, built to improve scheduling, billing, routing, and dispatching processes.

    RouteGenie NEMT software is a modern system built to automate all non-emergency medical transportation processes including routing, scheduling, dispatching, and billing. It helps manage everyday challenges like vehicle breakdowns, traffic problems, cancelations, driver call-offs, will calls, no shows, add-on trips, on-demand trips, and more.
    Learn More
  • 1
    AutoResearchClaw

    AutoResearchClaw

    Autonomous research from idea to paper. Chat an Idea. Get a Paper 🦞

    AutoResearchClaw is an open-source framework designed to automatically generate full academic research papers from a single idea or topic. Built in Python, it orchestrates a multi-stage research pipeline that gathers literature, formulates hypotheses, runs experiments, analyzes results, and writes the final paper. The system retrieves real academic references from sources such as arXiv and Semantic Scholar to ensure credible citations. It can automatically generate code for experiments, run...
    Downloads: 32 This Week
    Last Update:
    See Project
  • 2
    nanoGPT

    nanoGPT

    The simplest, fastest repository for training/finetuning models

    NanoGPT is a minimalistic yet powerful reimplementation of GPT-style transformers created by Andrej Karpathy for educational and research use. It distills the GPT architecture into a few hundred lines of Python code, making it far easier to understand than large, production-scale implementations. The repo is organized with a training pipeline (dataset preprocessing, model definition, optimizer, training loop) and inference script so you can train a small GPT on text datasets like Shakespeare...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    Megatron-LM

    Megatron-LM

    Ongoing research training transformer models at scale

    Megatron-LM is a GPU-optimized deep learning framework from NVIDIA designed to train extremely large transformer-based language models efficiently at scale. The repository provides both a reference training implementation and Megatron Core, a composable library of high-performance building blocks for custom large-model pipelines. It supports advanced parallelism strategies including tensor, pipeline, data, expert, and context parallelism, enabling training across massive multi-GPU and...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    AI Researcher

    AI Researcher

    An autonomous AI researcher

    AI Researcher is an experimental open-source project that demonstrates how multiple AI agents can collaborate to conduct complex research tasks from start to finish with minimal human intervention. It orchestrates agents that can generate research questions, perform literature reviews, execute experiments, analyze results, and synthesize findings into structured outputs like reports or code. Each agent operates with clear roles — such as researcher, analyst, and summarizer — and they...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Teradata VantageCloud Enterprise is a data analytics platform for performing advanced analytics on AWS, Azure, and Google Cloud. Icon
    Teradata VantageCloud Enterprise is a data analytics platform for performing advanced analytics on AWS, Azure, and Google Cloud.

    Power faster innovation with Teradata VantageCloud

    VantageCloud is the complete cloud analytics and data platform, delivering harmonized data and Trusted AI for all. Built for performance, flexibility, and openness, VantageCloud enables organizations to unify diverse data sources, run complex analytics, and deploy AI models—all within a single, scalable platform.
    Learn More
  • 5
    JSONLab

    JSONLab

    JSONLab: compact, portable, robust JSON/binary-JSON encoder

    JSONLab is a free and open-source JSON/UBJSON/MessagePack encoder and decoder written in the native MATLAB language. It can be used to convert a MATLAB data structure (array, struct, cell, struct array, cell array, and objects) into JSON/UBJSON/MessagePack formatted strings and files, or to parse a JSON/UBJSON/MessagePack file into MATLAB data structure. JSONLab supports nearly all versions of MATLAB and GNU Octave (a free MATLAB clone). The development of JSONLab is currently funded by the...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    xrayutilities

    xrayutilities

    a package with useful scripts for X-ray diffraction physicists

    xrayutilities is a python package used to analyze x-ray diffraction data. It can support with performing diffraction experiments and used for common steps in the data analysis. It can read experimental data from several data formats (spec, edf, xrdml, ...); convert them to reciprocal space for arbitrary goniometer geometries and different detector systems (point, linear as well as area detectors); for further processing the data can be gridded (transformed to a regular grid). ...
    Downloads: 28 This Week
    Last Update:
    See Project
  • 7
    Libro

    Libro

    An interactive program for statistical analysis of texts

    A cross-platform text analysis program written in Python and Free Pascal/Lazarus which scans a whole text file (in plain text, HTML, EPUB, or ODT formats) and ranks all used words according to frequency, performing a quantitative analysis of the text using Shannon-Weaver information statistic and Zipf power law function. It counts words, sentences, chars, spaces, and syllables.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 8
    Fuzzy machine learning framework

    Fuzzy machine learning framework

    A library and a GUI front-end for fuzzy machine learning

    Fuzzy machine learning framework is a library and a GUI front-end for machine learning using intuitionistic fuzzy data. The approach is based on the intuitionistic fuzzy sets and the possibility theory. Further characteristics are fuzzy features and classes; numeric, enumeration features and features based on linguistic variables; user-defined features; derived and evaluated features; classifiers as features for building hierarchical systems; automatic refinement in case of dependent...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    CiteSpace

    CiteSpace

    A widely used tool for visual exploration of scientific literature.

    Visit the new site: https://citespace.podia.com CiteSpace generates interactive visualizations of structural and temporal patterns and trends of a scientific field. It facilitates a systematic review of a knowledge domain through an in-depth visual analytic process. It can process citation data from popular sources such as the Web of Science, Scopus, Dimensions, and the Lens. CiteSpace also supports basic visual analytic functions for datasets without citation-related information, for...
    Leader badge
    Downloads: 2,227 This Week
    Last Update:
    See Project
  • The CI/CD Platform built for Mobile DevOps Icon
    The CI/CD Platform built for Mobile DevOps

    For mobile app developers interested in a powerful CI/CD platform for mobile app development and mobile DevOps

    Save time, money, and developer frustration with fast, flexible, and scalable mobile CI/CD that just works. Whether you swear by native or would rather go cross-platform, we have you covered. From Swift to Objective-C, Java to Kotlin, as well as Xamarin, Cordova, Ionic, React Native, and Flutter: Whatever you choose, we will automatically configure your initial workflows and have you building in minutes.
    Learn More
  • 10
    Artifact GeoMorph Toolbox 3D 3.1

    Artifact GeoMorph Toolbox 3D 3.1

    A toolbox for 3DGM shape analysis of archaeological artifacts

    The Artifact Geomorph Toolbox 3D software is designed to provide the archaeologist interested in artifact shape variability with a toolbox to allow the acquisition, analysis and results exploration of homologous 3D landmark-based geometric morphometric data. As such, the toolbox contains an automated item and semi-landmarks positioning procedure and the fundamental statistical analyses and procedures to allow the processing and analysis of the data. It is designed to be easy to use and straightforward with a simple graphical user interface. ...
    Downloads: 15 This Week
    Last Update:
    See Project
  • 11
    ACORBA

    ACORBA

    Automated approach to measure root tip angles of Arabidopsis thaliana

    Gravitropic response is studied in most of the laboratories working with Arabidopsis thaliana, for example, to detect new phenotypes in mutants. However, manual analysis of images and microscopy data are known to be subjected to human bias. This is particularly the case for manual measurements of root bending as the angle is set subjectively. In this context, it is essential to develop and use automated or semi-automated image analysis to produce faster, reproducible, and unbiased data. In this context, we developped ACORBA (Automatic Calculation Of Root Bending Angles), a fully automated software to measure root bending angle over time.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    DIG

    DIG

    A library for graph deep learning research

    The key difference with current graph deep learning libraries, such as PyTorch Geometric (PyG) and Deep Graph Library (DGL), is that, while PyG and DGL support basic graph deep learning operations, DIG provides a unified testbed for higher level, research-oriented graph deep learning tasks, such as graph generation, self-supervised learning, explainability, 3D graphs, and graph out-of-distribution. If you are working or plan to work on research in graph deep learning, DIG enables you to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    AllenNLP

    AllenNLP

    An open-source NLP research library, built on PyTorch

    AllenNLP makes it easy to design and evaluate new deep learning models for nearly any NLP problem, along with the infrastructure to easily run them in the cloud or on your laptop. AllenNLP includes reference implementations of high quality models for both core NLP problems (e.g. semantic role labeling) and NLP applications (e.g. textual entailment). AllenNLP supports loading "plugins" dynamically. A plugin is just a Python package that provides custom registered classes or additional...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Brain Tokyo Workshop

    Brain Tokyo Workshop

    Experiments and code from Google Brain’s Tokyo research workshop

    The Brain Tokyo Workshop repository hosts a collection of research materials and experimental code developed by the Google Brain team based in Tokyo. It showcases a variety of cutting-edge projects in artificial intelligence, particularly in the areas of neuroevolution, reinforcement learning, and model interpretability. Each project explores innovative approaches to learning, prediction, and creativity in neural networks, often through unconventional or biologically inspired methods. The...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    Catalyst

    Catalyst

    Accelerated deep learning R&D

    Catalyst is a PyTorch framework for accelerated Deep Learning research and development. It allows you to write compact but full-featured Deep Learning pipelines with just a few lines of code. With Catalyst you get a full set of features including a training loop with metrics, model checkpointing and more, all without the boilerplate. Catalyst is focused on reproducibility, rapid experimentation, and codebase reuse so you can break the cycle of writing another regular train loop and make...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    PS-Drone

    PS-Drone

    Programming a Parrot AR.Drone 2.0 with Python - The Easy Way

    The PS-Drone-API is a full featured SDK, written in and for Python, for Parrot's AR.Drone 2.0. It was designed to be easy to learn, but it offers the full set of the possibilities of the AR.Drone 2.0, including Sensor-Data (aka NavData), Configuration and full Video-support. The video function is not restricted to mere viewing, it is also possible to analyze video images data using OpenCV2.
    Leader badge
    Downloads: 4 This Week
    Last Update:
    See Project
  • 17
    Ceka

    Ceka

    Crowd Environment and its Knowledge Analysis

    A knowledge analysis tool for crowdsourcing based on Weka. We also have a Python version of Crowdsourcing Learning: CrowdwiseKit on GitHub (https://github.com/tssai-lab/CrowdwiseKit).
    Downloads: 5 This Week
    Last Update:
    See Project
  • 18
    wav2letter++

    wav2letter++

    Facebook AI research's automatic speech recognition toolkit

    ...At least one of LZMA, BZip2, or Z is required for LM compression with KenLM. It is highly recommended to build KenLM with position-independent code (-fPIC) enabled, to enable python compatibility. After installing, run export KENLM_ROOT_DIR=... so that wav2letter++ can find it. This is needed because KenLM doesn't support a make install step.wav2letter++ expects audio and transcription data to be prepared in a specific format so that they can be read from the pipelines. Each dataset (test/valid/train) needs to be in a separate file with one sample per line. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Data Science at the Command Line

    Data Science at the Command Line

    Data science at the command line

    ...To get you started, author Jeroen Janssens provides a Docker image packed with over 100 Unix power tools, useful whether you work with Windows, macOS, or Linux. You’ll quickly discover why the command line is an agile, scalable, and extensible technology. Even if you’re comfortable processing data with Python or R, you’ll learn how to greatly improve your data science workflow by leveraging the command line’s power.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    This project is intended to provide code to be used with MySQL and Python to create a database of major league baseball game events which are freely provided by the mlb.com Gameday application. Older version also support creating a retrosheet.org database but that is no longer supported. All major and minor league pitch location and game statistic data can be downloaded using BBOS. Installation Videos!
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    NiftyNet

    NiftyNet

    An open-source convolutional neural networks platform for research

    ...Adapt existing networks to your imaging data. Quickly build new solutions to your own image analysis problems. NiftyNet currently supports medical image segmentation and generative adversarial networks. NiftyNet is not intended for clinical use.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Pydicom by examples

    Pydicom by examples

    Basic and intermediate examples of DICOM library with Jupyter

    Basic and intermediate examples to read, modify and write DICOM files with Python code using Jupyter - To install Jupyter - https://jupyter.org/install ====== All examples are based on Pydicom. An open source library - https://pydicom.github.io/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Deep Learning for Medical Applications

    Deep Learning for Medical Applications

    Deep Learning Papers on Medical Image Analysis

    Deep-Learning-for-Medical-Applications is a repository that compiles deep learning methods, code implementations, and examples applied to medical imaging and healthcare data. The project addresses domain-specific challenges like segmentation, classification, detection, and multimodal data (e.g. MRI, CT, X-ray) using state-of-the-art architectures (e.g. U-Net, ResNet, GAN variants) tailored to medical constraints (small datasets, annotation costs, class imbalance). It includes Jupyter...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    E-Reader

    Extreem Reader for Cosmological simulation data P-Gadet2 and ART codes

    Extreem Reader for Cosmological simulation data P-Gadet2 and ART codes.It conserves memory by using memory mapped files. Contains some basic analysis tools such a KD-Tree, Minimum Spanning Tree, and rendering module using PMViewer algorithm.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    TextTools
    TextTools is a freeware corpus linguistics tool developed in Python to aid in research. This program analyzes user-created corpora and displays information about word (token) frequency, n-grams, clusters, collocations, keyword in context (KWIC), and keyness. TextTools is designed to be user-friendly and intuitive and will run natively on Mac OS X.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB