Showing 547 open source projects for "python data analysis"

View related business solutions
  • The Secure Workspace for Remote Work Icon
    The Secure Workspace for Remote Work

    Venn isolates and protects work from any personal use on the same computer, whether BYO or company issued.

    Venn is a secure workspace for remote work that isolates and protects work from any personal use on the same computer. Work lives in a secure local enclave that is company controlled, where all data is encrypted and access is managed. Within the enclave – visually indicated by the Blue Border around these applications – business activity is walled off from anything that happens on the personal side. As a result, work and personal uses can now safely coexist on the same computer.
    Learn More
  • Discover the power of eDiscovery for law firms. Icon
    Discover the power of eDiscovery for law firms.

    Streamline your legal processes and ensure compliance with our eDiscovery company.

    DWR eDiscovery allows legal professionals to process, analyze, review, and produce documents that are relevant to litigation and other legal disclosure obligations. Our tools allow easy ingestion and analysis of client and opposing party documents using a comprehensive set of document review features including AI search, keyword search, keyword highlighting, metadata filtering, marking documents, privilege log management, redactions, and a range of analysis tools to help users best understand their document corpus.
    Learn More
  • 1
    JSON2YOLO

    JSON2YOLO

    Convert JSON annotations into YOLO format.

    Explore our state-of-the-art AI architecture to train and deploy your highly accurate AI models like a pro. This directory contains label import/export software developed by Ultralytics LLC, and is freely available for redistribution under the GPL-3.0 license. Ultralytics is a U.S.-based particle physics and AI startup with over 6 years of expertise supporting government, academic, and business clients. We offer a wide range of vision AI services, spanning from simple expert advice up to the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Django REST Pandas

    Django REST Pandas

    Serves up Pandas dataframes via the Django REST Framework

    Django REST Pandas (DRP) provides a simple way to generate and serve pandas DataFrames via the Django REST Framework. The resulting API can serve up CSV (and a number of other formats for consumption by a client-side visualization tool like d3.js. The design philosophy of DRP enforces a strict separation between data and presentation. This keeps the implementation simple, but also has the nice side effect of making it trivial to provide the source data for your visualizations. This...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    PDF Merge and Edit

    PDF Merge and Edit

    Python script to merge and edit sensitive PDF files

    ...Pick your files and enter in the data. If there are no problems, a confirmation will pop up.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    Canorus

    Canorus

    Music score editor

    Canorus is a free cross-platform music score editor. It supports an unlimited number and length of staffs, polyphony, a MIDI playback of notes, chord markings, lyrics, import/export filters to formats like MIDI, MusicXML, ABC Music, MusiXTeX and LilyPond
    Downloads: 14 This Week
    Last Update:
    See Project
  • Professional Streaming and Video Hosting - GDPR Compliant - 3Q Icon
    Professional Streaming and Video Hosting - GDPR Compliant - 3Q

    Secure hosting, scalable streaming, and easy integration for internal and external communications

    3Q offers a multifunctional video platform for hosting, managing and distributing video and audio content on all channels. Live and on-demand.
    Learn More
  • 5

    FOSS license

    FOSS license and sentence token

    We propose a method to mark the comments of license as sentence-token. We use the term sentence-token to refer to a sentence of a known license. A license (both by-inclusion or by-reference) is a sequence of sentence-tokens. Sentence-tokens are generalized using one or more regular expressions. we propose an idea for license identification based on the analysis of each sentence in the license statement of a source code file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    mk1utils

    mk1utils

    mk1utils: tools for the Wersi MK1, EX20, DX10, EX10R, DX5

    mk1utils: tools for the Wersi MK1, EX20, DX10, EX10R, DX5 Included tools: * mk1parse: parse SYX and ROM/RAM files of Wersi MK1, EX20, DX10, EX10R, DX5 * mk1reqall: create SYX and MIDI files for requesting voices from Wersi MK1, EX20, DX10, EX10R * romcvp_dx10_to_mk1: convert CVs+Presets ROM/RAM from Wersi DX10 to MK1 format Special features: Decoding of envelope microcode (modul, special, and standard), rhythms, and sequences. Export of rhythms to MIDI files and waves to WAV files....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Bots open source edi translator

    Bots open source edi translator

    Bots is a complete translator for edi: edifact, x12, xml, tradacoms

    Bots is a complete translator for edi (Electronic Data Interchange). EDI data formats eg: edifact, x12, tradacoms, xml. Mail: http://groups.google.com/group/botsmail Web-site: http://bots.sourceforge.net Wiki: http://bots.readthedocs.io Develop: https://github.com/eppye-bots/bots
    Downloads: 18 This Week
    Last Update:
    See Project
  • 8
    scriptLattes is an open-source system to create academic reports of groups based on curricula vitae of the Brazilian Lattes Database.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 9
    Python ADB

    Python ADB

    Python ADB + Fastboot implementation

    ...Under the hood it speaks the ADB protocol directly and can connect via USB or over TCP, which is useful for lab setups and headless servers. Because it’s Python, you can compose device actions with your favorite testing, scraping, or data-collection libraries in one process. The project also includes utilities for robust connection handling and timeouts so flaky USB links don’t derail long runs. It’s well-suited to CI test farms, large-scale telemetry, and custom device control workflows.
    Downloads: 8 This Week
    Last Update:
    See Project
  • DataImpulse - Ethical Provider of Residential, Mobile, and Data Center IPs Icon
    DataImpulse - Ethical Provider of Residential, Mobile, and Data Center IPs

    For anyone looking for residential proxies, mobile proxies, and datacenter proxies

    DataImpulse (dataimpulse.com) is a proxy provider offering a pool of over 90 million ethically sourced residential, mobile, and data center IP addresses from 195 countries. Pricing for residential proxies starts at $1 per GB with a pay-as-you-go model; there are no subscriptions or traffic expiration dates.
    Learn More
  • 10
    PDF-Shuffler
    PDF-Shuffler is a small python-gtk application, which helps the user to merge or split pdf documents and rotate, crop and rearrange their pages using an interactive and intuitive graphical interface. It is a frontend for python-pyPdf.
    Leader badge
    Downloads: 51 This Week
    Last Update:
    See Project
  • 11
    unfluff

    unfluff

    Automatically extract body content (and other cool stuff) from HTML

    unfluff is a Node.js library designed to automatically extract the main content from an HTML document — stripping away navigation bars, ads, footers and other boilerplate to leave you with the “body content”, metadata (title, author, date) and other useful fields. It’s a tool very much aimed at content-analysis, web scraping, building datasets, or repurposing article text for downstream processing (like machine-learning or summarization). The API is simple: you feed in raw HTML and it...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    cquery

    cquery

    C/C++ language server supporting multi-million line code base

    C/C++ language server supporting multi-million line code base, powered by libclang. Emacs, Vim, VSCode, and others with language server protocol support. Cross-references, completion, diagnostics, semantic highlighting, and more. cquery is a highly-scalable, low-latency language server for C/C++/Objective-C. It is tested and designed for large codebases like Chromium. cquery provides accurate and fast semantic analysis without interrupting workflow. cquery implements almost the entire...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    AerinSistemas-Noname

    Elasticsearch to Pandas dataframe or CSV

    API and command line utility, written in Python, for querying Elasticsearch exporting result as documents into a CSV file. The search can be done using logical operators or ranges, in combination or alone. The output can be limited to the desired attributes. Also ToT can insert the querying to a Pandas Dataframe or/and save its in a HDF5 container (under development).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    pyspider

    pyspider

    A powerful Spider(Web Crawler) system in Python

    pyspider is a powerful Spider(Web Crawler) system in Python. Components are connected by message queue. Every component, including message queue, is running in their own process/thread, and replaceable. That means, when process is slow, you can have many instances of processor and make full use of multiple CPUs, or deploy to multiple machines. This architecture makes pyspider really fast. benchmarking.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    jsondata

    jsondata

    Modular JSON by trees and branches, pointers and patches

    The 'jsondata' package provides for the modular in-memory processing of JSON data by trees, branches, pointers, and patches. The main interface classes are: - JSONData - Core for RFC7159 based data structures. Provides modular data components. - JSONDataSerializer - Core for RFC7159 based data persistence. Provides modular data serialization. - JSONPointer - RFC6901 for addressing by pointer paths. Provides pointer arithmetics. - JSON Relative Pointer -...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    FlexibleParser_Java

    FlexibleParser_Java

    Group of multi-purpose Java-converted-from-C# parsing libraries.

    FlexibleParser (Java) is currently formed by the following independent libraries/parts: - UnitParser. It allows to easily deal with a wide variety of situations involving units of measurement. - NumberParser. It provides a common framework for all the .NET numeric types. AUTHORSHIP & COPYRIGHT I, Alvaro Carballo Garcia (varocarbas), am the sole author of each single bit of this code. All the contents of this repository can be considered public domain. For more information about my...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    FlexibleParser

    FlexibleParser

    FlexibleParser is a group of multi-purpose .NET parsing libraries.

    FlexibleParser (C#/.NET) is currently formed by the following independent libraries/parts: - UnitParser. It allows to easily deal with a wide variety of situations involving units of measurement. - NumberParser. It provides a common framework for all the .NET numeric types. - DateParser. It enhances the default .NET date/time support. AUTHORSHIP & COPYRIGHT I, Alvaro Carballo Garcia (varocarbas), am the sole author of each single bit of this code. All the contents of this...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    survol

    RDF-based framework monitoring business systems activity

    A Python agent and a web interface aiming to help the analysis and investigation of a legacy application. A set of machines, processes, databases, programs etc ... all communicating with each other, manipulating your data, and whose software architecture has become, with time, complicated, difficult to understand, and undocumented. Data are aggregated with an RDF inference engine, creating a global vision of the business information processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    **CODE MOVED TO GITHUB: https://github.com/bitextor ** Bitextor is an application created to generate translation memories using multilingual websites as a corpus source. It downloads an entire website and applies a set of heuristics (based mainly on HTML tag structure and text block length) to find bitexts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    PyAIMLng

    The Next Generation of Python AIML Interpreter

    A Python AIML interpreter with non-compliant extensions. PyAIMLng is an interpreter for AIML (the Artificial Intelligence Markup Language), forked from Cort Stratton's PyAIML. PyAIMLng adds additional features which are not part of the AIML 1.0.1 specification in order to provide the bot master with a rich set of tools from which to build a more believable AIML bot.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    transitfeed

    Tools for reading, validating & writing transit schedule info in GTFS

    transitfeed helps to make public transit information projects more successful for agencies and other interested parties. It provides a library to help you parse, validate, and generate General Transit Feed Spec (GTFS) feed files. It contains several parts, including TransitFeed (Python package for reading, writing, and validating feeds); Feedvalidator (command line tool that checks a GTFS feed for problems); ScheduleViewer (application for exploring a feed on a map) and more; each with their...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    FilteredPush
    Network software for annotation and quality control of distributed data. Supported by NSF: DBI 0646266 and NSF: DBI 0960535. http://wiki.filteredpush.org
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    IdeoType is a book compiler that converts manuscript (XHTML) to book (PDF) on the fly.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    redfish-lab

    Get started with the Redfish RESTful API from the DMTF

    Redfish-lab allows a smooth ramp-up with the Redfish RESTful API on an HPE ProLiant server, including UEFI/BIOS configuration with various scripting languages like PowerShell and Python. Small tutorials/articles are also proposed in the Wiki section.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    WikiSQL

    WikiSQL

    A large annotated semantic parsing corpus for developing NL interfaces

    A large crowd-sourced dataset for developing natural language interfaces for relational databases. WikiSQL is the dataset released along with our work Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning. Regarding tokenization and Stanza, when WikiSQL was written 3-years ago, it relied on Stanza, a CoreNLP python wrapper that has since been deprecated. If you'd still like to use the tokenizer, please use the docker image. We do not anticipate switching...
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB