Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence
Large Language Models (LLM)
Search Results

Search Results for "python data analysis" - Page 4

x

Sort By:

Relevance

Clear All Filters

OS

Linux 181
Mac 179
Windows 179
More...
BSD 111
ChromeOS 111

Category

Artificial Intelligence 182
Software Development 3
Business 2
System 2

License

OSI-Approved Open Source 174

Programming Language

Python 155
TypeScript 11
Go 5
JavaScript 5
More...
Rust 3
C 1
C++ 1
Unix Shell 1

Showing 182 open source projects for "python data analysis"

View related business solutions

Large Language Models (LLM) Clear Filters & Widen Search

ShareCRM is an AI-powered enterprise CRM platform designed to connect data and teams across the entire customer lifecycle.
Trusted by 6000+ Large and Medium Enterprises

ShareCRM is an AI-powered, customizable and affordable enterprise CRM solution to seamlessly integrate and empower every aspect of your business.

Learn More
Empowering Companies To Excel In Safety Data Sheet Compliance
For any organization using chemicals that require Safety Data Sheets

Effortless setup and maintenance: Simplified management and seamless online access to safety data sheets for your team

Learn More
1

MegaParse

File Parser optimised for LLM Ingestion with no loss

MegaParse is a file parser optimized for Large Language Model (LLM) ingestion, ensuring no loss of information. It efficiently parses various document formats, such as PDFs, DOCX, and PPTX, converting them into formats ideal for processing by LLMs. This tool is essential for applications that require accurate and comprehensive data extraction from diverse document types.

Downloads: 0 This Week

Last Update: 2025-02-14
See Project
2

Deep Lake

Data Lake for Deep Learning. Build, manage, and query datasets

Deep Lake (formerly known as Activeloop Hub) is a data lake for deep learning applications. Our open-source dataset format is optimized for rapid streaming and querying of data while training models at scale, and it includes a simple API for creating, storing, and collaborating on AI datasets of any size. It can be deployed locally or in the cloud, and it enables you to store all of your data in one place, ranging from simple annotations to large videos. Deep Lake is used by Google, Waymo,...

Downloads: 0 This Week

Last Update: 2026-02-12
See Project
3

MetaScreener

AI-powered tool for efficient abstract and PDF screening

MetaScreener is an open-source AI-assisted tool designed to streamline the screening process in systematic literature reviews and academic research workflows. The system helps researchers analyze large collections of academic abstracts and research papers to determine which studies are relevant for inclusion in evidence synthesis projects. Instead of manually reviewing hundreds or thousands of documents, researchers can use MetaScreener to apply machine learning techniques that assist with...

Downloads: 2 This Week

Last Update: 2026-03-09
See Project
4

Prometheus-Eval

Evaluate your LLM's response with Prometheus and GPT4

...It implements an “LLM-as-a-judge” approach in which a dedicated language model analyzes instruction–response pairs and assigns scores or rankings based on predefined evaluation criteria. The repository includes a Python package that provides a straightforward interface for running evaluations and integrating them into model development pipelines. It also provides training data and utilities for fine-tuning evaluator models so they can assess outputs according to custom scoring rubrics such as helpfulness, accuracy, or style.

Downloads: 2 This Week

Last Update: 2026-03-09
See Project
Network Discovery Software | JDisc Discovery
JDisc Discovery supports the IT organizationss of medium-sized businesses and large-scale enterprises.

JDisc Discovery is a comprehensive network inventory and IT asset management solution designed to help organizations gain clear, up-to-date visibility into their IT environment. It automatically scans and maps devices across the network, including servers, workstations, virtual machines, and network hardware, to create a detailed inventory of all connected assets. This includes critical information such as hardware configurations, software installations, patch levels, and relationshipots between devices.

Learn More
5

OpenLLMetry

Open-source observability for your LLM application

The repo contains standard OpenTelemetry instrumentations for LLM providers and Vector DBs, as well as a Traceloop SDK that makes it easy to get started with OpenLLMetry, while still outputting standard OpenTelemetry data that can be connected to your observability stack. If you already have OpenTelemetry instrumented, you can just add any of our instrumentations directly.

Downloads: 2 This Week

Last Update: 11 hours ago
See Project
6

TTRL

Test-Time Reinforcement Learning

TTRL is an open-source framework for test-time reinforcement learning in large language models, with a particular focus on reasoning tasks where ground-truth labels are not available during inference. The project addresses the problem of how to generate useful reward signals from unlabeled test-time data, and its central insight is that common test-time scaling practices such as majority voting can be repurposed into reward estimates for online reinforcement learning. This makes the...

Downloads: 0 This Week

Last Update: 2026-03-10
See Project
7

RecAI

Bridging LLM and Recommender System

RecAI is an open-source research platform developed by Microsoft to explore how large language models can be integrated into modern recommender systems. Traditional recommender systems rely on structured behavioral data such as user interactions and item embeddings, while large language models excel at understanding language and reasoning about user preferences. RecAI aims to bridge these two domains by creating architectures and training methods that allow LLMs to function as intelligent...

Downloads: 0 This Week

Last Update: 2026-03-09
See Project
8

OM1

Modular AI runtime for robots

OM1 is an open-source AI platform designed to build autonomous agents capable of interacting with digital environments and completing complex tasks. The project focuses on creating a modular architecture where language models can coordinate with external tools, APIs, and knowledge sources to accomplish multi-step objectives. Instead of operating as simple conversational systems, OM1 agents can plan actions, retrieve information, and execute tasks across different services. The framework...

Downloads: 9 This Week

Last Update: 6 days ago
See Project
9

LLaMA-Mesh

Unifying 3D Mesh Generation with Language Models

LLaMA-Mesh is a research framework that extends large language models so they can understand and generate 3D mesh data alongside text. The system introduces a method for representing 3D meshes in a textual format by encoding vertex coordinates and face definitions as sequences that can be processed by a language model. By serializing 3D geometry into text tokens, the approach allows existing transformer architectures to generate and interpret 3D models without requiring specialized visual...

Downloads: 0 This Week

Last Update: 2026-03-09
See Project
The Cloud Sales Acceleration Platform
For businesses wanting a platform to list, manage, and co-sell on cloud marketplaces with minimal engineering effort

Streamline and automate your cloud sales cycle, enhance operational efficiency, and capitalize on marketplace opportunities with the Clazar Cloud Sales Acceleration Platform.

Learn More
10

Hephaestus

Semi-Structured Agentic Framework. Workflows build themselves

Hephaestus is an open-source semi-structured agentic framework designed to orchestrate multiple AI agents working together on complex tasks. Instead of relying entirely on predefined workflows, the framework allows agents to dynamically create tasks as they explore a problem space. Developers define high-level phases such as analysis, implementation, and testing, while agents generate specific subtasks within those phases. The system continuously monitors agent behavior and task progression,...

Downloads: 1 This Week

Last Update: 2026-03-15
See Project
11

SGR Agent Core

Schema-Guided Reasoning (SGR) has agentic system design

SGR Agent Core is an open-source framework for building intelligent AI research agents based on a methodology known as Schema-Guided Reasoning (SGR). The framework provides a core library that allows developers to design autonomous agents capable of structured reasoning and complex task execution. Instead of relying solely on free-form prompts, the system organizes reasoning processes around schemas that guide how agents analyze problems, gather information, and generate outputs. This...

Downloads: 2 This Week

Last Update: 2026-03-18
See Project
12

trench

Open-Source Analytics Infrastructure

Trench is an open-source analytics infrastructure designed for tracking events and performing real-time analysis of application data at scale. The system is built on top of high-performance data technologies including Apache Kafka and ClickHouse, which allows it to ingest and process very large volumes of events while maintaining fast query performance. It was originally developed to solve scaling challenges in product analytics systems where traditional relational databases become inefficient as event tables grow. ...

Downloads: 0 This Week

Last Update: 2026-03-06
See Project
13

Agents 2.0

An Open-source Framework for Data-centric Language Agents

Agents is an open-source framework designed to build and train autonomous language agents through a data-centric and learning-oriented architecture. The project introduces a concept known as agent symbolic learning, which treats an agent pipeline similarly to a neural network computational graph. In this framework, each node in the pipeline represents a step in the reasoning or action process, while prompts and tools act as adjustable parameters analogous to neural network weights. During...

Downloads: 0 This Week

Last Update: 2026-03-04
See Project
14

GenAI Agents

Implementations for various Generative AI Agent techniques

GenAI Agents is a large, tutorial-driven repository that teaches you how to design, build, and experiment with generative AI agents. It spans a spectrum from simple conversational bots and basic question-answering agents to complex multi-agent systems that coordinate on research, education, business workflows, and creative tasks. The implementations leverage modern frameworks such as LangChain, LangGraph, AutoGen, PydanticAI, CrewAI, and more, showing how each can be wired into realistic...

Downloads: 2 This Week

Last Update: 2026-04-11
See Project
15

code-act

Official Repo for ICML 2024 paper

...The framework also includes training data, models, and evaluation tools designed to study how language models can become more capable autonomous agents.

Downloads: 0 This Week

Last Update: 2026-03-06
See Project
16

All-in-RAG

Big Model Application Development Practice 1

All-in-RAG is an open-source educational project designed to teach developers how to build applications using retrieval-augmented generation techniques. The repository provides a structured learning path that covers both theoretical foundations and practical implementation steps for RAG systems. It explains the full development pipeline required to create knowledge-aware AI assistants, including data preparation, document indexing, vector embedding generation, and retrieval strategies. The...

Downloads: 0 This Week

Last Update: 2026-03-17
See Project
17

JamAI Base

The collaborative spreadsheet for AI

JamAI Base is an open-source backend platform designed to simplify the development of retrieval-augmented generation systems and AI-driven applications. The platform integrates both a relational database and a vector database into a single embedded architecture, allowing developers to store structured data alongside semantic embeddings. It includes built-in orchestration for large language models, vector search, and reranking pipelines so that AI applications can retrieve relevant...

Downloads: 2 This Week

Last Update: 2026-03-09
See Project
18

Index

The SOTA Open-Source Browser Agent

Index is an open-source browser automation agent designed to autonomously perform complex tasks across websites by transforming web interfaces into programmable APIs. The system enables developers to instruct an AI agent to interact with web pages using natural language rather than traditional automation scripts. Instead of writing detailed browser automation code, users can describe the desired task and allow the agent to interpret the page structure, interact with elements, and complete...

Downloads: 1 This Week

Last Update: 2026-03-06
See Project
19

slime LLM

slime is an LLM post-training framework for RL Scaling

slime is an open-source large language model (LLM) post-training framework developed to support reinforcement learning (RL)-based scaling and high-performance training workflows for advanced LLMs, blending training and rollout modules into an extensible system. It offers a flexible architecture that connects high-throughput training (e.g., via Megatron-LM) with a customizable data generation pipeline, enabling researchers and engineers to iterate on new RL training paradigms effectively. The...

Downloads: 1 This Week

Last Update: 2026-03-29
See Project
20

LLM-Pruner

On the Structural Pruning of Large Language Models

LLM-Pruner is an open-source framework designed to compress large language models through structured pruning techniques while maintaining their general capabilities. Large language models often require enormous computational resources, making them expensive to deploy and inefficient for many practical applications. LLM-Pruner addresses this issue by identifying and removing non-essential components within transformer architectures, such as redundant attention heads or feed-forward...

Downloads: 1 This Week

Last Update: 2026-03-09
See Project
21

Swirl

Swirl queries any number of data sources with APIs

...It's intended for use by developers and data scientists who want to solve multi-silo search problems from enterprise search to new monitoring & alerting solutions that push information to users continuously. Built on the Python/Django/RabbitMQ stack, SWIRL includes connectors to Apache Solr, ChatGPT, Elastic, OpenSearch | PostgreSQL, Google BigQuery plus generic HTTP/GET/JSON with configurations for premium services.

Downloads: 0 This Week

Last Update: 2025-12-11
See Project
22

LlamaDeploy

Deploy your agentic worfklows to production

llama_deploy is an open-source framework designed to simplify the deployment and productionization of agent-based AI workflows built with the LlamaIndex ecosystem. The project provides an asynchronous architecture that allows developers to deploy complex multi-agent workflows as scalable microservices. It enables teams to move from experimental prototypes to production systems with minimal changes to existing LlamaIndex code, making it easier to operationalize AI agents. The system supports...

Downloads: 4 This Week

Last Update: 2026-04-06
See Project
23

Ludwig AI

Low-code framework for building custom LLMs, neural networks

Declarative deep learning framework built for scale and efficiency. Ludwig is a low-code framework for building custom AI models like LLMs and other deep neural networks. Declarative YAML configuration file is all you need to train a state-of-the-art LLM on your data. Support for multi-task and multi-modality learning. Comprehensive config validation detects invalid parameter combinations and prevents runtime failures. Automatic batch size selection, distributed training (DDP, DeepSpeed),...

Downloads: 4 This Week

Last Update: 4 days ago
See Project
24

Xorbits Inference

Replace OpenAI GPT with another LLM in your app

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop. Xorbits Inference(Xinference) is a powerful and versatile library designed to serve language, speech recognition, and multimodal models. With Xorbits...

Downloads: 8 This Week

Last Update: 2026-04-12
See Project
25

Slack MCP Server

The most powerful MCP Slack Server with no permission requirements

Slack MCP Server is an open-source server implementation that connects Slack workspaces to AI systems through the Model Context Protocol (MCP). MCP is a standardized protocol that allows large language models and AI agents to securely interact with external tools and data sources such as messaging platforms, databases, or file systems. The slack-mcp-server acts as an intermediary layer that exposes Slack data and messaging functionality to AI clients while enforcing access rules and communication standards. Through this architecture, AI assistants can read message histories, interact with channels, and retrieve contextual information from Slack conversations in order to perform tasks such as automated analysis, collaboration assistance, or contextual code review. ...

Downloads: 3 This Week

Last Update: 2026-03-09
See Project

Previous
1
2
3
You're on page 4
5
6
7
8
Next

Related Searches

xinference

ai

train ai

python ai

Related Categories

Artificial Intelligence

Software Development

Business

System

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Privacy Choices Advertise