Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Search Results

Search Results for "dtmf decoder python"

x

Sort By:

Relevance

Clear All Filters

OS

ChromeOS 25
BSD 25
Linux 25
More...
Mac 25
Windows 25
Mobile Operating Systems 1

Category

Artificial Intelligence 13
Multimedia 4
Software Development 3
Communications 2
Business 1
Games 1
Mobile 1
Scientific/Engineering 1
Security 1

License

OSI-Approved Open Source 23

Translations

English 1
French 1

Programming Language

Python 22
Java 2
C 1
C++ 1
More...
C# 1
Rust 1

Status

Beta 5
Alpha 2
Production/Stable 2

25 projects for "dtmf decoder python" with 1 filter applied:

ChromeOS Clear Filters & Widen Search

FusionAuth: Authentication and User Management Software
Offer your users flexible authentication options, including passwords, passwordless, single sign-on (SSO), and multi-factor authentication (MFA).

FusionAuth adds login, registration, SSO, MFA, and a bazillion other features to your app in days - not months.

Learn More
Save up to 90% off rates for USPS, UPS, DHL Express, and more with the best multi-carrier shipping software for e-commerce businesses.
For Small / Medium E-Commerce Businesses

Whether you're established or just getting started, Shippo is the best shipping software for growing e-commerce brands that need to save time and money, fulfill and ship at scale, and delight customers. Create shipping labels for all carriers & save money with discounted rates. See all your online sales channels in one place and automatically access discounted USPS and DHL Express rates, or use your own carrier accounts. Sign up is free and there are no monthly fees or cancellation fees.

Learn More
1

Whisper

Robust Speech Recognition via Large-Scale Weak Supervision

OpenAI Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification. A Transformer sequence-to-sequence model is trained on various speech processing tasks, including multilingual speech recognition, speech translation, spoken language identification, and voice activity detection. These tasks are jointly represented...

Downloads: 78 This Week

Last Update: 2025-06-26
See Project
2

Segment Anything

Provides code for running inference with the SegmentAnything Model

...It’s a promptable segmenter: you guide it with points, boxes, or rough masks, and it predicts high-quality object masks consistent with the prompt. The architecture separates a powerful image encoder from a lightweight mask decoder, so the heavy vision work can be computed once and the interactive part stays fast. A bundled automatic mask generator can sweep an image and propose many object masks, which is useful for dataset bootstrapping or bulk annotation. The repository includes ready-to-use weights, Python APIs, and example notebooks demonstrating both interactive and automatic modes. ...

Downloads: 2 This Week

Last Update: 2025-10-06
See Project
3

Kronos

A Foundation Model for the Language of Financial Markets

Kronos is a specialized open-source foundation model designed for analyzing and predicting financial market data using time-series representations of candlestick patterns. It is built as a decoder-only Transformer model trained specifically on K-line data, which captures open, high, low, close, and volume information across multiple global exchanges. The system introduces a novel tokenization approach that converts continuous financial data into discrete tokens, enabling the model to process...

Downloads: 12 This Week

Last Update: 2026-04-10
See Project
4

GLM-OCR

Accurate × Fast × Comprehensive

GLM-OCR is an open-source multimodal optical character recognition (OCR) model built on a GLM-V encoder–decoder foundation that brings robust, accurate document understanding to complex real-world layouts and modalities. Designed to handle text recognition, table parsing, formula extraction, and general information retrieval from documents containing mixed content, GLM-OCR excels across major benchmarks while remaining highly efficient with a relatively compact parameter size (~0.9B),...

Downloads: 12 This Week

Last Update: 2026-04-08
See Project
Assembled is the only unified platform for staffing and managing your human and AI support team.
AI for world-class support operations

Assembled is the only platform that unifies AI agents and intelligent workforce management to power fast and flexible support operations. Built for scale, we help teams automate over 50% of customer interactions, forecast with 90%+ accuracy, and optimize staffing across in-house and BPO teams. Orchestrate every chat, email, or call, balancing workloads between human and AI agents in real time — without sacrificing quality or control. Trusted by Stripe, Canva, and Robinhood, Assembled transforms support from a cost center into a strategic advantage. Our Workforce and Vendor Management tools connect forecasting, scheduling, and performance for smarter staffing decisions. AI Agents automate conversations across channels with your workflows and brand voice. AI Copilot empowers agents with real-time guidance, suggested replies, and one-click actions for faster, higher-quality resolutions.

Learn More
5

IndexTTS2

Industrial-level controllable zero-shot text-to-speech system

IndexTTS is a modern, zero-shot text-to-speech (TTS) system engineered to deliver high-quality, natural-sounding speech synthesis with few requirements and strong voice-cloning capabilities. It builds on state-of-the-art models such as XTTS and other modern neural TTS backbones, improving them with a conformer-based speech conditional encoder and upgrading the decoder to a high-quality vocoder (BigVGAN2), leading to clearer and more natural audio output. The system supports zero-shot voice...

Downloads: 7 This Week

Last Update: 2025-11-27
See Project
6

ESPnet

End-to-end speech processing toolkit

ESPnet is a comprehensive end-to-end speech processing toolkit covering a wide spectrum of tasks, including automatic speech recognition (ASR), text-to-speech (TTS), speech translation (ST), speech enhancement, speaker diarization, and spoken language understanding. It uses PyTorch as its deep learning engine and adopts a Kaldi-style data processing pipeline for features, data formats, and experimental recipes. This combination allows researchers to leverage modern neural architectures while...

Downloads: 2 This Week

Last Update: 2026-04-07
See Project
7

TimesFM

Pretrained time-series foundation model developed by Google Research

TimesFM is a pretrained time-series foundation model from Google Research built for forecasting tasks, designed to generalize across many domains without requiring extensive per-dataset retraining. It provides a decoder-only model approach to forecasting, aiming for strong performance even in zero-shot or low-data settings where traditional models often struggle. The project includes code and an inference API intended to make it practical to run forecasts programmatically, with options to...

Downloads: 0 This Week

Last Update: 2026-02-19
See Project
8

Multimodal

TorchMultimodal is a PyTorch library

This project, also known as TorchMultimodal, is a PyTorch library for building, training, and experimenting with multimodal, multi-task models at scale. The library provides modular building blocks such as encoders, fusion modules, loss functions, and transformations that support combining modalities (vision, text, audio, etc.) in unified architectures. It includes a collection of ready model classes—like ALBEF, CLIP, BLIP-2, COCA, FLAVA, MDETR, and Omnivore—that serve as reference...

Downloads: 0 This Week

Last Update: 2026-01-12
See Project
9

AV1 AVIF

AV1 Image File Format Specification - ISO-BMFF/HEIF derivative

AV1 AVIF is the official specification and reference design for the AV1 Image File Format (AVIF), defining how AV1-encoded bitstreams are packaged into the HEIF container format (based on ISOBMFF) to produce AVIF files. The project outlines the syntax and semantics required for AVIF compliance, including support for multiple image profiles, color depths, chroma subsampling modes, HDR/WCG, alpha channels, animation/image sequences, and various color-space/bit-depth combinations — making AVIF...

Downloads: 0 This Week

Last Update: 2025-12-08
See Project
GoAnywhere Managed File Transfer (MFT)
Secure and simplify your file transfers

GoAnywhere MFT provides secure managed file transfer for enterprises. Deployable on-premise, in the cloud, or in hybrid environments, GoAnywhere MFT software enables organizations to exchange data among employees, customers, and trading partners, as well as between systems, securely. GoAnywhere MFT was a recipient of the Cybersecurity Excellence Award for Secure File Transfer.

Learn More
10

CSM (Conversational Speech Model)

A Conversational Speech Generation Model

The CSM (Conversational Speech Model) is a speech generation model developed by Sesame AI that creates RVQ audio codes from text and audio inputs. It uses a Llama backbone and a smaller audio decoder to produce audio codes for realistic speech synthesis. The model has been fine-tuned for interactive voice demos and is hosted on platforms like Hugging Face for testing. CSM offers a flexible setup and is compatible with CUDA-enabled GPUs for efficient execution.

Downloads: 5 This Week

Last Update: 2025-03-19
See Project
11

iced

Blazing fast and correct x86/x64 disassembler, assembler, decoder, etc

iced is a powerful and feature-rich disassembly and assembly library for x86 and x64 architectures, designed to provide accurate decoding, encoding, and formatting of machine instructions. It supports multiple programming languages, including C#, Rust, and Python, making it accessible to a wide range of developers. The library offers both disassembly and assembly capabilities, allowing users to convert between machine code and human-readable instructions in both directions. It includes...

Downloads: 7 This Week

Last Update: 2026-04-06
See Project
12

DiffSinger

Singing Voice Synthesis via Shallow Diffusion Mechanism

DiffSinger is an open-source PyTorch implementation of a diffusion-based acoustic model for singing-voice synthesis (SVS) and also text-to-speech (TTS) in a related variant. The core idea is to view generation of a sung voice (mel-spectrogram) as a diffusion process: starting from noise, the model iteratively “denoises” while being conditioned on a music score (lyrics, pitch, musical timing). This avoids some of the typical problems of prior SVS models — like over-smoothing or unstable GAN...

Downloads: 43 This Week

Last Update: 2025-11-28
See Project
13

Mask2Former

Code release for "Masked-attention Mask Transformer

Mask2Former is a unified segmentation architecture that handles semantic, instance, and panoptic segmentation with one model and one training recipe. Its core idea is to cast segmentation as mask classification: a transformer decoder predicts a set of mask queries, each with an associated class score, eliminating the need for task-specific heads. A pixel decoder fuses multi-scale features and feeds masked attention in the transformer so each query focuses computation on its current spatial...

Downloads: 0 This Week

Last Update: 2025-10-07
See Project
14

MAE (Masked Autoencoders)

PyTorch implementation of MAE

MAE (Masked Autoencoders) is a self-supervised learning framework for visual representation learning using masked image modeling. It trains a Vision Transformer (ViT) by randomly masking a high percentage of image patches (typically 75%) and reconstructing the missing content from the remaining visible patches. This forces the model to learn semantic structure and global context without supervision. The encoder processes only the visible patches, while a lightweight decoder reconstructs the...

Downloads: 0 This Week

Last Update: 2025-10-06
See Project
15

Denoiser

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)

Denoiser is a real-time speech enhancement model operating directly on raw waveforms, designed to clean noisy audio while running efficiently on CPU. It uses a causal encoder-decoder architecture with skip connections, optimized with losses defined both in the time domain and frequency domain to better suppress noise while preserving speech. Unlike models that operate on spectrograms alone, this design enables lower latency and coherent waveform output. The implementation includes data...

Downloads: 2 This Week

Last Update: 2025-10-07
See Project
16

ALAE

Adversarial Latent Autoencoders

ALAE (Adversarial Latent Autoencoders) is a deep learning research implementation that combines autoencoders with generative adversarial networks to produce high-quality image synthesis models. The project implements the architecture introduced in the CVPR research paper on Adversarial Latent Autoencoders, which focuses on improving generative modeling by learning latent representations aligned with adversarial training objectives. Unlike traditional GANs that directly generate images from...

Downloads: 0 This Week

Last Update: 2026-03-11
See Project
17

OpenSeq2Seq

Toolkit for efficient experimentation with Speech Recognition

OpenSeq2Seq is a TensorFlow-based toolkit for efficient experimentation with sequence-to-sequence models across speech and NLP tasks. Its core goal is to give researchers a flexible, modular framework for building and training encoder–decoder architectures while fully leveraging distributed and mixed-precision training. The toolkit includes ready-made models for neural machine translation, automatic speech recognition, speech synthesis, language modeling, and additional NLP tasks such as...

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
18

seq2seq

A general-purpose encoder-decoder framework for Tensorflow

seq2seq is an early, influential TensorFlow reference implementation for sequence-to-sequence learning with attention, covering tasks like neural machine translation, summarization, and dialogue. It packaged encoders, decoders, attention mechanisms, and beam search into a modular training and inference framework. The codebase showcased best practices for batching, bucketing by sequence length, and handling variable-length sequences efficiently on GPUs. Researchers used it as a baseline to...

Downloads: 0 This Week

Last Update: 2025-10-09
See Project
19

toneDetect

toneDetect

A sequential tone decoder geared towards the type of paging used by Fire/EMS. It is written in Python for portability and has been tested on Windows, Linux and Mac. It provides email notification of pages which includes the audio.

Downloads: 1 This Week

Last Update: 2015-07-30
See Project
20

File based DTMF detector

DTMF detector library and/or application that reads in the specified audio file and returns/outputs whether DTMF detected and/or list of detected digits. May have GUI and command line interfaces.

Downloads: 1 This Week

Last Update: 2015-07-22
See Project
21

Tag2Utf cyrillic mp3-tags decoder

Tool for encoding tags of mp3 files in the cyrillic charsets (cp1251, koi8-r) to unicode. Solution of problem with displaying tags in thе different charsets in a playlist. If you will find bug, mistake in the this text - mail me on hlamer@tut.by

Downloads: 1 This Week

Last Update: 2015-02-23
See Project
22

pyqrcode

pyqrcode is a python extension for acessing java qrcode decoder.

Downloads: 0 This Week

Last Update: 2018-01-22
See Project
23

SRO Protocol Decoder

Network protocol decoder for SRO.

Downloads: 0 This Week

Last Update: 2013-04-24
See Project
24

org.keyphrene

Python wrapper for LibSSH2 and OpenSSL libraries. Includes: yEnc (coder, decoder), hunspell, Par2, crc64

Downloads: 0 This Week

Last Update: 2013-03-22
See Project
25

PyHackerKey

A Simple Encoder/Decoder Application for Hacker Keys (http://www.hackerkey.com) written in Python.

Downloads: 0 This Week

Last Update: 2016-07-26
See Project

Previous
You're on page 1
Next

Related Searches

whisper

whisper-windows-x64.exe

speech to text

delphi speech recognition

ocr

mega-voice

linux

dtmf decoder python

vb dtmf detector

qrcode

Related Categories

Artificial Intelligence

Multimedia

Software Development

Communications

Business

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Privacy Choices Advertise