Search Results for "python data analysis" - Page 16

Sort By:

Showing 1057 open source projects for "python data analysis"

View related business solutions

Artificial Intelligence Python Clear Filters & Widen Search

Hightouch is a data and AI platform for marketing and personalization.
Marketing needs data and AI. Give them Hightouch.

Find insights, run real-time campaigns, and build AI agents with all your data.

Learn More
All-in-one solution to control corporate spending
Issuance in seconds. Full spending control. Perfect for media buying.

Wallester Business is a leading world-class solution to optimize your company’s financial processes! Issuing virtual and physical corporate expense cards with an IBAN account, expense monitoring, limit regulation, convenient accounting, subscription control — manage your finance on all-in-one platform in real time! Wallester Business benefits your business growth!

Learn More
1

SGR Agent Core

Schema-Guided Reasoning (SGR) has agentic system design

SGR Agent Core is an open-source framework for building intelligent AI research agents based on a methodology known as Schema-Guided Reasoning (SGR). The framework provides a core library that allows developers to design autonomous agents capable of structured reasoning and complex task execution. Instead of relying solely on free-form prompts, the system organizes reasoning processes around schemas that guide how agents analyze problems, gather information, and generate outputs. This...

Downloads: 7 This Week

Last Update: 2026-03-18
See Project
2

Skill Scanner

Security Scanner for Agent Skills

This repository is a public security-focused scanning tool intended to analyze and assess AI agent skills for potential issues, quality concerns, and vulnerabilities. It acts as a scanner that inspects Agent Skills packages to flag structural problems, inconsistencies, or security flaws before they are deployed or integrated into agent workflows. Because agent skills can contain executable instructions and logic, scanning them for risky patterns is essential to prevent inadvertent...

Downloads: 7 This Week

Last Update: 6 days ago
See Project
3

NVIDIA NeMo Framework

Scalable generative AI framework built for researchers and developers

...It provides collections of domain-specific modules and reference implementations that make it easier to pre-train, fine-tune, and deploy very large models on multi-GPU and multi-node infrastructure. NeMo 2.0 introduces a Python-based configuration system, replacing YAML with more flexible, programmable configs that can be versioned and composed for different experiments. The framework builds on PyTorch Lightning–style modular abstractions, so training scripts are composed from reusable components for data loading, models, optimizers, and schedulers, which simplifies experimentation and adaptation. ...

Downloads: 2 This Week

Last Update: 2026-03-23
See Project
4

OpenDAN

OpenDAN is an open source Personal AI OS

OpenDAN is an open-source Personal AI OS , that consolidates various AI modules in one place for your personal use. The goal of OpenDAN (Open and Do Anything Now with AI) is to create a Personal AI OS , which provides a runtime environment for various Al modules as well as protocols for interoperability between them. With OpenDAN, users can securely collaborate with various AI modules using their private data to create powerful personal AI agents, such as butlers, lawyers, doctors, teachers,...

Downloads: 5 This Week

Last Update: 2024-08-12
See Project
The AI coach for teams, built on validated assessments.
Cloverleaf is an assessment-backed AI Coach that fully understands your people and the context of their workday.

Give managers and teams proactive, contextual coaching to lead effectively, communicate clearly, and navigate real work situations as they happen.

Learn More
5

Datasets

Hub of ready-to-use datasets for ML models

Datasets is a library for easily accessing and sharing datasets, and evaluation metrics for Natural Language Processing (NLP), computer vision, and audio tasks. Load a dataset in a single line of code, and use our powerful data processing methods to quickly get your dataset ready for training in a deep learning model. Backed by the Apache Arrow format, process large datasets with zero-copy reads without any memory constraints for optimal speed and efficiency. We also feature a deep...

Downloads: 6 This Week

Last Update: 2026-03-23
See Project
6

TensorFlow

TensorFlow is an open source library for machine learning

Originally developed by Google for internal use, TensorFlow is an open source platform for machine learning. Available across all common operating systems (desktop, server and mobile), TensorFlow provides stable APIs for Python and C as well as APIs that are not guaranteed to be backwards compatible or are 3rd party for a variety of other languages. The platform can be easily deployed on multiple CPUs, GPUs and Google's proprietary chip, the tensor processing unit (TPU). TensorFlow...

2 Reviews

Downloads: 12 This Week

Last Update: 2026-03-07
See Project
7

DINOv3

Reference PyTorch implementation and models for DINOv3

DINOv3 is the third-generation iteration of Meta’s self-supervised visual representation learning framework, building upon the ideas from DINO and DINOv2. It continues the paradigm of learning strong image representations without labels using teacher–student distillation, but introduces a simplified and more scalable training recipe that performs well across datasets and architectures. DINOv3 removes the need for complex augmentations or momentum encoders, streamlining the pipeline while...

Downloads: 24 This Week

Last Update: 2026-03-30
See Project
8

MetaScreener

AI-powered tool for efficient abstract and PDF screening

MetaScreener is an open-source AI-assisted tool designed to streamline the screening process in systematic literature reviews and academic research workflows. The system helps researchers analyze large collections of academic abstracts and research papers to determine which studies are relevant for inclusion in evidence synthesis projects. Instead of manually reviewing hundreds or thousands of documents, researchers can use MetaScreener to apply machine learning techniques that assist with...

Downloads: 4 This Week

Last Update: 2026-03-09
See Project
9

Llama Cookbook

Solve end to end problems using Llama model family

The Llama Cookbook is the official Meta LLaMA guide for inference, fine‑tuning, RAG, and multi-step use-cases. It offers recipes, code samples, and integration examples across provider platforms (WhatsApp, SQL, long context workflows), enabling developers to quickly harness LLaMA models

Downloads: 0 This Week

Last Update: 2025-06-28
See Project
Kinetic Software - Epicor ERP
Discrete, make-to-order and mixed-mode manufacturers who need a global cloud ERP solution

Grow, thrive, and compete in a global marketplace with Kinetic—an industry-tailored, cognitive ERP that helps you work smarter and stay connected.

Learn More
10

GPT Academic

Research-oriented chatbot framework

GPT Academic is a research-oriented chatbot framework designed to integrate large language models (LLMs) into academic workflows. It provides tools for structured document processing, citation management, and enhanced interaction with research papers.

Downloads: 0 This Week

Last Update: 2025-03-04
See Project
11

Prometheus-Eval

Evaluate your LLM's response with Prometheus and GPT4

...It implements an “LLM-as-a-judge” approach in which a dedicated language model analyzes instruction–response pairs and assigns scores or rankings based on predefined evaluation criteria. The repository includes a Python package that provides a straightforward interface for running evaluations and integrating them into model development pipelines. It also provides training data and utilities for fine-tuning evaluator models so they can assess outputs according to custom scoring rubrics such as helpfulness, accuracy, or style.

Downloads: 5 This Week

Last Update: 2026-03-09
See Project
12

slime LLM

slime is an LLM post-training framework for RL Scaling

slime is an open-source large language model (LLM) post-training framework developed to support reinforcement learning (RL)-based scaling and high-performance training workflows for advanced LLMs, blending training and rollout modules into an extensible system. It offers a flexible architecture that connects high-throughput training (e.g., via Megatron-LM) with a customizable data generation pipeline, enabling researchers and engineers to iterate on new RL training paradigms effectively. The...

Downloads: 7 This Week

Last Update: 2026-03-29
See Project
13

AudioCraft

Audiocraft is a library for audio processing and generation

...Both models operate over discrete audio tokens produced by a neural codec (EnCodec), which acts like a tokenizer for waveforms and enables efficient sequence modeling. The repo provides inference scripts, checkpoints, and simple Python APIs so you can generate clips from prompts or incorporate the models into applications. It also contains training code and recipes, so researchers can fine-tune on custom data or explore new objectives without building infrastructure from scratch. Example notebooks, CLI tools, and audio utilities help with prompt design, conditioning on reference audio, and post-processing to produce ready-to-share outputs.

Downloads: 7 This Week

Last Update: 2025-10-13
See Project
14

TorchRec

Pytorch domain library for recommendation systems

TorchRec is a PyTorch domain library built to provide common sparsity & parallelism primitives needed for large-scale recommender systems (RecSys). It allows authors to train models with large embedding tables sharded across many GPUs. Parallelism primitives that enable easy authoring of large, performant multi-device/multi-node models using hybrid data-parallelism/model-parallelism. The TorchRec sharder can shard embedding tables with different sharding strategies including data-parallel,...

Downloads: 1 This Week

Last Update: 2026-03-15
See Project
15

Ultravox

Fast multimodal LLM for real-time voice interaction and AI apps

Ultravox is an open source multimodal large language model designed specifically for real-time voice-based interactions. It is built to process both text and spoken audio directly, eliminating the need for a separate speech recognition stage and enabling more seamless conversational experiences. Ultravox works by combining text prompts with encoded audio inputs, allowing it to understand spoken language alongside written instructions in a unified pipeline. Internally, it leverages pretrained...

Downloads: 6 This Week

Last Update: 2026-03-18
See Project
16

MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training

MedicalGPT training medical GPT model with ChatGPT training pipeline, implementation of Pretraining, Supervised Finetuning, Reward Modeling and Reinforcement Learning. MedicalGPT trains large medical models, including secondary pre-training, supervised fine-tuning, reward modeling, and reinforcement learning training.

Downloads: 9 This Week

Last Update: 3 days ago
See Project
17

Genv

GPU environment management and cluster orchestration

Genv is an open-source environment and cluster management system for GPUs. Genv lets you easily control, configure, monitor and enforce the GPU resources that you are using in a GPU machine or cluster. It is intended to ease up the process of GPU allocation for data scientists without code changes.

Downloads: 8 This Week

Last Update: 2024-05-16
See Project
18

AutoClip

AI-powered video clipping and highlight generation

AutoClip is an open-source, AI-powered video processing system designed to automate the extraction of “highlight” segments from full-length videos — ideal for creators who want to generate bite-sized clips, compilations, or highlight reels without manually sifting through hours of footage. The system supports downloading videos from major platforms (e.g. YouTube, Bilibili), or accepting local uploads, and then applies AI analysis to identify segments worth clipping based on content (e.g....

Downloads: 17 This Week

Last Update: 2025-12-08
See Project
19

Denoising Diffusion Probabilistic Model

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Implementation of Denoising Diffusion Probabilistic Model in Pytorch. It is a new approach to generative modeling that may have the potential to rival GANs. It uses denoising score matching to estimate the gradient of the data distribution, followed by Langevin sampling to sample from the true distribution. If you simply want to pass in a folder name and the desired image dimensions, you can use the Trainer class to easily train a model.

Downloads: 6 This Week

Last Update: 2026-02-11
See Project
20

ChatterBot

Machine learning, conversational dialog engine for creating chat bots

ChatterBot is a Python library that makes it easy to generate automated responses to a user’s input. ChatterBot uses a selection of machine learning algorithms to produce different types of responses. This makes it easy for developers to create chat bots and automate conversations with users. For more details about the ideas and concepts behind ChatterBot see the process flow diagram.

Downloads: 8 This Week

Last Update: 2026-03-24
See Project
21

autoresearch for AMD

AI agents running research on single-GPU nanochat training

autoresearch for AMD is a framework for autonomous scientific experimentation in machine learning, enabling AI agents to iteratively improve models through a continuous loop of hypothesis generation, experimentation, and evaluation. The system is built around a minimal structure that includes a data preparation module, a training script that can be modified, and a program specification that guides the agent’s decision-making process. During each iteration, the agent edits the training code,...

Downloads: 1 This Week

Last Update: 2026-03-30
See Project
22

Continuous Claude v3

Context management for Claude Code. Hooks maintain state via ledgers

Continuous Claude v3 is a persistent, multi-agent development environment built around the Claude Code CLI that aims to overcome the limitations of standard LLM context windows. Rather than relying on a single session’s context, Continuous Claude uses mechanisms like ledgers, YAML handoffs, and a memory system to preserve and recall state across multiple sessions, ensuring that learned insights and plans are not lost when context compaction occurs. The project orchestrates many specialized...

Downloads: 0 This Week

Last Update: 2026-01-30
See Project
23

LuxTTS

A high-quality rapid TTS voice cloning model

...It implements a lightweight architecture based on ZipVoice and optimized sampling techniques so that it can generate speech at speeds up to roughly 150 times real-time on a single GPU and faster than real-time on CPU, all while producing audio at high fidelity with 48 kHz quality. The project supports zero-shot voice cloning, meaning it can adapt to a reference speaker’s voice with minimal example data, enabling realistic and personalized synthetic speech. Intended for developers, hobbyists, and creators, the repository includes installation instructions, usage examples, and Python APIs that make it feasible to integrate the model in local workflows, web demos, or production systems. Its design emphasizes efficiency and practicality, fitting within modest GPU memory footprints.

Downloads: 4 This Week

Last Update: 2026-03-12
See Project
24

bbox-visualizer

Make drawing and labeling bounding boxes easy as cake

Make drawing and labeling bounding boxes easy as cake. This package helps users draw bounding boxes around objects, without doing the clumsy math that you'd need to do for positioning the labels. It also has a few different types of visualizations you can use for labeling objects after identifying them. There are optional functions that can draw multiple bounding boxes and/or write multiple labels on the same image, but it is advisable to use the above functions in a loop in order to have...

Downloads: 2 This Week

Last Update: 2026-01-29
See Project
25

talos

Hyperparameter Optimization for TensorFlow, Keras and PyTorch

Talos radically changes the ordinary Keras, TensorFlow (tf.keras), and PyTorch workflow by fully automating hyperparameter tuning and model evaluation. Talos exposes Keras and TensorFlow (tf.keras) and PyTorch functionality entirely and there is no new syntax or templates to learn. Talos is made for data scientists and data engineers that want to remain in complete control of their TensorFlow (tf.keras) and PyTorch models, but are tired of mindless parameter hopping and confusing...

Downloads: 2 This Week

Last Update: 2024-04-21
See Project