Open Source Python Multimedia Software - Page 8

Python Multimedia Software

View 4872 business solutions

Browse free open source Python Multimedia Software and projects below. Use the toggles on the left to filter open source Python Multimedia Software by OS, license, language, programming language, and project status.

  • Cloudbrink Personal SASE service Icon
    Cloudbrink Personal SASE service

    For companies looking for low maintenance, secure, high performance connectivity for hybrid and remote workers

    Cloudbrink’s Personal SASE is a high-performance connectivity and security service that delivers a lightning-fast, in-office experience to the modern hybrid workforce anywhere. Combining high-performance ZTNA with Automated Moving Target Defense (AMTD), and Personal SD-WAN all connections are ultra-secure.
    Learn More
  • Track time for payroll, billing and productivity Icon
    Track time for payroll, billing and productivity

    Flexible time and billing software that enables teams to easily track time and expenses for payroll, projects, and client billing.

    Because time is money, and we understand how challenging it can be to keep track of employee hours. The constant reminder to log timesheets so your business can increase billables, run an accurate payroll and remove the guesswork from project estimates – we get it.
    Learn More
  • 1
    YouTube Music Desktop Player

    YouTube Music Desktop Player

    Turns the YouTube Music site into a desktop application.

    Turns the YouTube Music site into a cross-platform desktop application for Windows and Linux using QtWebEngine.
    Downloads: 35 This Week
    Last Update:
    See Project
  • 2
    A2M — Audio to MIDI

    A2M — Audio to MIDI

    A2M is a desktop app that converts AUDIO TO MIDI in one click.

    A2M (Audio To MIDI) is a simple desktop tool for transcribing local audio files into MIDI files with one click. It is designed primarily for piano recording transcription, and works best on solo piano recordings. Using A2M is straightforward: Select an audio file, click Convert, and the application generates a MIDI file automatically in your Downloads/A2M folder. All processing is done locally on your device, no uploads, no accounts, and no telemetry. The app runs on CPU by default, with optional NVIDIA GPU acceleration for faster conversions. Project links: Website: justagwas.com/projects/a2m GitHub: github.com/Justagwas/a2m Documentation: https://github.com/Justagwas/a2m/wiki A2M is fully open source and operates only on the files you choose. VirusTotal scan result: https://www.virustotal.com/gui/file/cc2a961baaaac2f8932c2e9ed04f0c27a55309cc03ed0825e44c8af18e263ce6
    Leader badge
    Downloads: 61 This Week
    Last Update:
    See Project
  • 3
    GIF-Overlay

    GIF-Overlay

    Pure Floating GIF & Image Viewer

    Visit Website: https://duyxyz.github.io/GIF-Overlay/ GIF Overlay is a minimalist and high-performance Windows application designed for viewing GIFs and images with a unique floating experience. Whether you need a reference image for drawing, a desktop decoration, or a lightweight GIF player, GIF Overlay provides a borderless, premium interface that stays exactly where you need it.
    Downloads: 61 This Week
    Last Update:
    See Project
  • 4
    Slim Camera

    Slim Camera

    Slim Camera - Lightweight RTSP Video Player

    Slim Camera is a lightweight RTSP viewer for IP cameras. On first launch, it prompts for the stream URL (saved for future sessions) and runs in the system tray to avoid taskbar clutter. It remembers window position, size, and camera URL via an INI file for seamless reuse. The interface keeps distractions minimal - just the video stream in an auto-sizing window. Right-click the tray icon to change the camera URL, restart the stream, reset window position, or exit. Press F1 to quickly modify the RTSP address. Optimized for low resource usage, it works reliably even on older hardware, making it perfect for background monitoring. With portable settings (single INI file) and focus on core functionality, Slim Camera delivers no-fuss video streaming for users who value simplicity. Support its free, open-source development with a donation at https://boosty.to/slim-camera/donate to help keep it ad-free and growing!
    Downloads: 60 This Week
    Last Update:
    See Project
  • Your go-to FinOps platform Icon
    Your go-to FinOps platform

    Analyze, optimize, and govern your multi-cloud environment effortlessly with AI Agentic FinOps.

    Unlike reporting-only FinOps tools, FinOpsly unifies cloud (AWS, Azure, GCP), data (Snowflake, Databricks, BigQuery), and AI costs into a single system of action — enabling teams to plan spend before it happens, automate optimization safely, and prove value in weeks, not quarters.
    Learn More
  • 5
    Automatically geocode pictures from your camera and a GPS track log. Following Google code closure the only official webpage is (doc, support, code) : https://github.com/notfrancois/GPicSync
    Downloads: 13 This Week
    Last Update:
    See Project
  • 6
    Airtime

    Airtime

    Open source broadcast automation software for scheduling and playout

    Airtime lets you take total control of your radio station via the web with intelligent archive management, powerful search, an easy playlist builder, a simple scheduling calendar and rock-solid automated playout. Features include Smart Blocks, live assist modes, WAV, FLAC, AAC, MP3 and OGG support, fades, cues, playlists, programme calendar, Icecast, Shoutcast and Soundcloud integration, DJ and station manager roles, JQuery widgets, Liquidsoap playout, and a record and rebroadcast functionality. Powerful media archive features 'watch' folders to allow stations to magically synchronise files, auto-import files and has multiple directory support. Airtime is free to download, runs on Ubuntu and Debian, and users can interact with it through any web browser. Airtime now comes with an 'easy install' package.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 7
    AnimateDiff

    AnimateDiff

    Plug-n-play module turning text-to-image models into animation

    AnimateDiff is an open-source project designed to enhance text-to-image diffusion models by adding animation capabilities. It allows users to turn static images generated by popular text-to-image models into animated sequences without requiring additional model training. This plug-and-play tool is compatible with a wide range of community models and facilitates the generation of animation directly from pre-existing text-to-image models. It supports various configurations to create animations with different visual styles, providing flexibility and ease of use for developers and artists interested in exploring dynamic, AI-generated animations.
    Leader badge
    Downloads: 32 This Week
    Last Update:
    See Project
  • 8
    easycap-app

    easycap-app

    Capture your screen with unprecedented ease and quality.

    Welcome to EasyCap, your ultimate desktop screen recorder and screenshot editor. Designed with simplicity and power in mind, EasyCap is perfect for professionals, creators, and anyone looking to capture their PC activities with ease. Whether you're creating tutorials, recording gameplay, or capturing important moments, EasyCap makes it effortless.
    Downloads: 32 This Week
    Last Update:
    See Project
  • 9
    Comix is a user-friendly, customizable image viewer. It is specifically designed to handle comic books, but also serves as a generic viewer. It reads images in ZIP, RAR or tar archives (also gzip or bzip2 compressed) as well as plain image files.
    Downloads: 12 This Week
    Last Update:
    See Project
  • GoAnywhere Managed File Transfer (MFT) Icon
    GoAnywhere Managed File Transfer (MFT)

    Secure and simplify your file transfers

    GoAnywhere MFT provides secure managed file transfer for enterprises. Deployable on-premise, in the cloud, or in hybrid environments, GoAnywhere MFT software enables organizations to exchange data among employees, customers, and trading partners, as well as between systems, securely. GoAnywhere MFT was a recipient of the Cybersecurity Excellence Award for Secure File Transfer.
    Learn More
  • 10
    3DDFA

    3DDFA

    Fast, accurate and stable 3D dense face alignment

    This work extends 3DDFA, named 3DDFA_V2, titled Towards Fast, Accurate and Stable 3D Dense Face Alignment, accepted by ECCV 2020. The supplementary material is here. The gif above shows a webcam demo of the tracking result, in the scenario of my lab. This repo is the official implementation of 3DDFA_V2. Compared to 3DDFA, 3DDFA_V2 achieves better performance and stability. Besides, 3DDFA_V2 incorporates the fast face detector FaceBoxes instead of Dlib. A simple 3D render written by c++ and cython is also included. This repo supports the onnxruntime, and the latency of regressing 3DMM parameters using the default backbone is about 1.35ms/image on CPU with a single image as input. See requirements.txt, tested on macOS and Linux platforms. The Windows users may refer to FQA for building issues. Note that this repo uses Python3. The major dependencies are PyTorch, numpy, opencv-python and onnxruntime, etc.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    AudioNotes

    AudioNotes

    Extract audio and video content and organize it into a Markdown note

    AudioNotes is an application (or proof-of-concept) that likely combines audio recording or playback with note-taking or annotation functionality — enabling users to record voice or audio and attach textual or timestamped notes, making it ideal for lectures, interviews, meetings, or personal memos. Such a tool offers a more expressive and flexible way to capture and revisit information: instead of just typed notes or raw audio, users get both audio context and structured notes. As an open-source repository, AudioNotes provides developers or power users the opportunity to customize how audio is captured, stored, annotated, and replayed — e.g. adding playback speed control, export to standard formats, or synchronization between notes and audio timeline. It may support simple UI for starting/stopping recordings, writing or editing notes, and navigating through recorded sessions.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    Crunch

    Crunch

    Insane(ly slow but wicked good) PNG image optimization

    Crunch is an image compression tool for lossy PNG image file optimization. Using a combination of selective bit depth, color palette reduction and color type, as well as zopfli DEFLATE compression algorithm encoding that employs the pngquant and zopflipng PNG optimization tools, Crunch is effectively able to optimize and compress images with minimal decrease in image quality. While it may produce file size gains larger than those produced by lossless approaches, the impact on image quality is often imperceptible, and optimized file sizes are still significantly lower than the original.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    FeelUOwn

    FeelUOwn

    Trying to be a robust, user-friendly and hackable music player

    FeelUOwn is a user-friendly, and hackable music player.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    Image Super-Resolution (ISR)

    Image Super-Resolution (ISR)

    Super-scale your images and run experiments with Residual Dense

    The goal of this project is to upscale and improve the quality of low-resolution images. This project contains Keras implementations of different Residual Dense Networks for Single Image Super-Resolution (ISR) as well as scripts to train these networks using content and adversarial loss components. Docker scripts and Google Colab notebooks are available to carry training and prediction. Also, we provide scripts to facilitate training on the cloud with AWS and Nvidia-docker with only a few commands. When training your own model, start with only PSNR loss (50+ epochs, depending on the dataset) and only then introduce GANS and feature loss. This can be controlled by the loss weights argument. The weights used to produce these images are available directly when creating the model object. ISR is compatible with Python 3.6 and is distributed under the Apache 2.0 license.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    Kornia

    Kornia

    Open Source Differentiable Computer Vision Library

    Kornia is a differentiable computer vision library for PyTorch. It consists of a set of routines and differentiable modules to solve generic computer vision problems. At its core, the package uses PyTorch as its main backend both for efficiency and to take advantage of the reverse-mode auto-differentiation to define and compute the gradient of complex functions. Inspired by existing packages, this library is composed by a subset of packages containing operators that can be inserted within neural networks to train models to perform image transformations, epipolar geometry, depth estimation, and low-level image processing such as filtering and edge detection that operate directly on tensors. With Kornia we fill the gap between classical and deep computer vision that implements standard and advanced vision algorithms for AI. Our libraries and initiatives are always according to the community needs.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    Mopidy

    Mopidy

    Mopidy is an extensible music server written in Python

    Mopidy plays music from local disk, Spotify, SoundCloud, TuneIn, and more. You can edit the playlist from any phone, tablet, or computer using a variety of MPD and web clients. Vanilla Mopidy only plays music from files and radio streams. Through extensions, Mopidy can play music from cloud services like Spotify, SoundCloud, and TuneIn. With Mopidy's extension support, you can easily add backends for new music sources. Mopidy is a Python application that runs in a terminal or in the background on Linux computers or Macs that have network connectivity and audio output. Out of the box, Mopidy is an HTTP server. If you install the Mopidy-MPD extension, it becomes an MPD server too. Many additional frontends for controlling Mopidy are available as extensions. You and the people around you can all connect their favorite MPD or web client to the Mopidy server to search for music and manage the playlist together.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    PersonaPlex

    PersonaPlex

    PersonaPlex code

    PersonaPlex is an open-source real-time conversational speech AI model that goes beyond traditional text chat by providing full-duplex speech-to-speech interaction, meaning it can listen and talk at the same time instead of waiting for you to finish speaking before responding. This architectural approach eliminates awkward pauses and makes conversations feel much more human-like, with natural behaviors such as overlapping speech, interruptions, and fluent turn-taking, traits that traditional AI assistants typically lack. PersonaPlex also supports persona and voice control, allowing developers to define the role and speaking style of the agent using text prompts and voice conditioning, making it suitable for applications like customized voice assistants, interactive character agents, or domain-specific conversational tools. Internally, it processes continuous audio streams in a hybrid input format so that speech understanding and generation occur jointly.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    PyTorch Natural Language Processing

    PyTorch Natural Language Processing

    Basic Utilities for PyTorch Natural Language Processing (NLP)

    PyTorch-NLP is a library for Natural Language Processing (NLP) in Python. It’s built with the very latest research in mind, and was designed from day one to support rapid prototyping. PyTorch-NLP comes with pre-trained embeddings, samplers, dataset loaders, metrics, neural network modules and text encoders. It’s open-source software, released under the BSD3 license. With your batch in hand, you can use PyTorch to develop and train your model using gradient descent. For example, check out this example code for training on the Stanford Natural Language Inference (SNLI) Corpus. Now you've setup your pipeline, you may want to ensure that some functions run deterministically. Wrap any code that's random, with fork_rng and you'll be good to go. Now that you've computed your vocabulary, you may want to make use of pre-trained word vectors to set your embeddings.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    Pydub

    Pydub

    Manipulate audio with a simple and easy high level interface

    Manipulate audio with a simple and easy high level interface. You can pass an optional bitrate argument to export using any syntax ffmpeg supports. Any further arguments supported by ffmpeg can be passed as a list in a 'parameters' argument, with switch first, argument second. Note that no validation takes place on these parameters, and you may be limited by what your particular build of ffmpeg/avlib supports. You can open and save WAV files with pure python. For opening and saving non-wav files, like mp3, you'll need ffmpeg or libav. Any operations that combine multiple AudioSegment objects in any way will first ensure that they have the same number of channels, frame rate, sample rate, bit depth, etc.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    Quod Libet

    Quod Libet

    Music player and music library manager for Linux, Windows, and macOS

    Quod Libet is a cross-platform audio/music management program. It provides many ways to view your local library, and supports streaming audio and feeds (podcasts, etc). It has extremely flexible metadata editing and searching capabilities. With over 90 plugins included, you can extend and integrate with almost anything, or write your own! Ex Falso is a bare-bones tag editor with the same editing interface as Quod Libet. Quod Libet is a GTK+-based audio player written in Python, using the Mutagen tagging library. It’s designed around the idea that you know how to organize your music better than we do. It lets you make playlists based on regular expressions (don’t worry, regular searches work too). It lets you display and edit any tags you want in the file, for all the file formats it supports. Unlike some, Quod Libet will scale to libraries with tens of thousands of songs. It also supports most of the features you’d expect from a modern media player.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    The FreeMoCap Project

    The FreeMoCap Project

    Free Motion Capture for Everyone

    FreeMoCap is an open-source markerless motion capture system that enables users to record human movement using ordinary cameras and convert the footage into usable 3D motion data. The project’s goal is to democratize motion capture by removing the need for expensive suits or proprietary studio hardware, instead relying on computer vision and pose estimation pipelines. It processes synchronized video feeds to reconstruct skeletal motion, which can then be exported for animation, biomechanics research, or creative projects. FreeMoCap includes tools for calibration, recording, processing, and visualization, allowing users to move from raw footage to structured motion data within a single ecosystem. Because it is open and extensible, researchers and developers can adapt the pipeline for specialized motion analysis or integrate it into animation workflows.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 22
    YouTube-8M

    YouTube-8M

    Starter code for working with the YouTube-8M dataset

    youtube-8m is Google’s open source starter code and reference implementation for training and evaluating machine learning models on the YouTube-8M dataset, one of the largest video understanding datasets publicly released. The repository provides a complete pipeline for video-level and frame-level modeling using TensorFlow, including data reading, model training, evaluation, and inference. It was developed to support the YouTube-8M Video Understanding Challenge (hosted on Kaggle and featured at ICCV 2019), enabling researchers and practitioners to benchmark video classification models on large-scale datasets with over millions of labeled videos. The code demonstrates how to process frame-level features, train logistic and deep learning models, evaluate them using metrics like global Average Precision (gAP) and mean Average Precision (mAP), and export trained models for MediaPipe inference.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    youtube-dlG

    youtube-dlG

    A cross platform front-end GUI of the popular youtube-dl

    A cross platform front-end GUI of the popular youtube-dl media downloader written in wxPython. Youtube-dl-gui uses youtube-dl in the backend to download files. Youtube-dl provides a list of extractors to work with each particular site. You can change the naming pattern by picking a different filename format under the Options>General tab. You can also use a custom pattern by setting the option to "Custom" and editing the output template field. You can change the number of max parallel downloads by editing the "workers_number" option in your settings.json file. You can add extra youtube-dl command line options in the commands box under the Options>Extra tab. You can use your own version of youtube-dl by editing the "youtubedl_path" option in your settings.json file and make it point to your own binary (e.g. /usr/local/bin).
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    Audiere is a portable audio library which supports playing MP3, Ogg Vorbis, FLAC, WAV, IT, XM, S3M, and MOD files. You can use it from C, C++, Python, Java, Delphi, and any language that supports XPCOM (JavaScript in Mozilla, for example).
    Downloads: 12 This Week
    Last Update:
    See Project
  • 25
    Canorus

    Canorus

    Music score editor

    Canorus is a free cross-platform music score editor. It supports an unlimited number and length of staffs, polyphony, a MIDI playback of notes, chord markings, lyrics, import/export filters to formats like MIDI, MusicXML, ABC Music, MusiXTeX and LilyPond
    Downloads: 16 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB