speech free download - SourceForge

Showing 19 open source projects for "speech"

View related business solutions

Scientific/Engineering Python Clear Filters & Widen Search

Construction Takeoff Software
Takeoffs From Anywhere

Stratosphere’s cutting edge cloud platform gives estimators unprecedented freedom to complete takeoffs from anywhere with next generation takeoff and markup tools.

Learn More
Inspections+ Mobile forms for Dynamics 365 - Resco.net
Start collecting field data without the hassles of complicated development thanks to resco.Inspections' native integration with Dynamics 365.

Equip your frontline teams with a robust digital solution to simplify data collection and reporting. Handle inspections and audits effortlessly, even in remote locations, and create comprehensive reports on the spot, all integrated with Dynamics 365.

Learn More
1

pyVideoTrans

Translate the video from one language to another and embed dubbing

pyVideoTrans is an ambitious open-source multimedia processing project that assembles speech recognition, subtitle generation, AI translation, voice synthesis, and video assembly into a unified pipeline for converting videos from one language to another with embedded dubbing and captions. At its core it runs speech-to-text models to transcribe audio tracks, translates the resulting text into a target language using local or cloud-based translation engines, synthesizes new speech to match the translated subtitles, and then merges that speech back into the video, creating a fully localized media file. ...

Downloads: 24 This Week

Last Update: 5 days ago
See Project
2

SPPAS

SPPAS - the automatic annotation and analyses of speech

SPPAS is a scientific computer software package written and maintained by Brigitte Bigi of the Laboratoire Parole et Langage, in Aix-en-Provence, France. Available for free, with open source code, there is simply no other package for linguists to simple use in the automatic annotations of speech, the analyses of any kind of annotated data and the conversion of annotated files. SPPAS is able to produce automatically speech annotations from a recorded speech sound and its orthographic transcription. SPPAS is helpful for the analysis of any annotated data: estimate statistical distributions, make requests, manage files, visualize annotations. ...

Downloads: 17 This Week

Last Update: 2026-04-06
See Project
3

Auditory Modeling Toolbox

The auditory modeling toolbox (AMT) is a Matlab/Octave toolbox for the development and application of auditory computational models. Over 50 auditory models implemented in Matlab, Octave, C, C++, and Python can be run from Matlab and Octave, on Windows and Linux. The AMT provides a well-structured in-code documentation, includes auditory data required to run the models. It integrates functionality to reproduce the model predictions. Model implementations can be evaluated in two stages,...

3 Reviews

Downloads: 29 This Week

Last Update: 2026-04-03
See Project
4

Tokenized Text Aligner

Aligns tokens in two versions of a text with differing tokenization.

...It is intended for use in the preparation of annotated linguistic corpora, where differences in tokenization may arise (i) following corrections or modifications to the source text or (ii) through the creation of different layers of annotation (part-of-speech, treebank) requiring different tokenization. In its default implementation, it produces a human-readable CSV table associating tokens in text A with tokens in text B, and can also inject token-level annotation from text B to text A. The Aligner class on which the default implementation is based can be incorporated into more complex workflows.

Downloads: 0 This Week

Last Update: 2026-02-06
See Project
Business password and access manager solution for IT security teams
Simplify Access, Secure Your Business

European businesses use Uniqkey to simplify password management, reclaim IT control and reduce password-based cyber risk. All in one super easy-to-use tool.

Learn More
5

RDRPOSTagger

A Rule-based Part-of-Speech and Morphological Tagging Toolkit

RDRPOSTagger is a robust, easy-to-use and language-independent rule-based toolkit for Part-of-Speech (POS) and morphological tagging. RDRPOSTagger obtains fast performance in both learning and tagging process. RDRPOSTagger also achieves a very competitive accuracy in comparison to the state-of-the-art results. RDRPOSTagger now supports pre-trained POS and morphological tagging models for Bulgarian, Czech, Dutch, English, French, German, Hindi, Italian, Portuguese, Spanish, Swedish, Thai and Vietnamese. ...

2 Reviews

Downloads: 0 This Week

Last Update: 2017-05-24
See Project
6

Distant Speech Recognition

Beamforming and Speech Recognition Toolkit

BTK contains C++ and Python libraries that implement speech processing and microphone array techniques such as speech feature extraction, speech enhancement, speaker tracking, beamforming, dereverberation and echo cancellation algorithms. The Millennium ASR provides C++ and python libraries for automatic speech recognition. The Millennium ASR implements a weighted finite state transducer (WFST) decoder, training and adaptation methods.

Downloads: 0 This Week

Last Update: 2019-08-21
See Project
7

yaafe

Yet Another Audio Feature Extractor is a toolbox for audio analysis. Easy to use and efficient at extracting a large number of audio features simultaneously. WAV and MP3 files supported, or embedding in C++, Python or Matlab applications.

1 Review

Downloads: 0 This Week

Last Update: 2016-02-25
See Project
8

ACOPOST - a collection of POS taggers

Part-of-speech tagging is the task of assigning symbols from a particular set to words in a natural language text. ACOPOST implements and extends well-known machine learning techniques and provides a uniform environment for testing.

1 Review

Downloads: 0 This Week

Last Update: 2016-02-26
See Project
9

Speech Research Tools

Software for speech research. It includes programs and libraries for signal processing, along with general purpose scientific libraries. Most of the code is in Python, with C/C++ supporting code. Also, contains code releases corresponding to publishe

Downloads: 0 This Week

Last Update: 2015-12-13
See Project
Diagnose and Resolve IT Issues in Real Time
Engage your employees and agents more efficiently with ScreenMeet as a seamless extension of your existing IT Service Delivery Platform.

ScreenMeet’s unique combination of video calling, screen share, and remote desktop functionality lets you quickly diagnose hardware and software issues with no frustration.

Learn More
10

pyespeak

Python to eSpeak speech synthesis

ctypes Python module for eSpeak http://espeak.sf.net speech synthesis

Downloads: 0 This Week

Last Update: 2017-10-28
See Project
11

Voice keyboard

Voice keyboard/dictation. Aims to be a total substitute for a keyboard. Spell out words letter by letter (using code: alpha, bravo, ..). Arrow keys, modifiers work. Speak whole words (but whole word accuracy is not good). Attach commands to some word

Downloads: 3 This Week

Last Update: 2015-04-20
See Project
12

SWIPE' pitch extractor

This is a fast C implementation of Arturo Camacho's SWIPE' pitch extraction algorithm. See the project homepage for more about the advantages of the SWIPE' algorithm. swipe-1.0.tar.gz contains the current source, which should compile quite neatly.

Downloads: 1 This Week

Last Update: 2013-04-11
See Project
13

PowerTalk

PowerTalk automatically speaks Microsoft PowerPoint presentations. For presenters who find speaking difficult, audiences containing people with visual impairments and fun educational uses. Uses synthesised computer speech provided with Windows

6 Reviews

Downloads: 24 This Week

Last Update: 2014-06-12
See Project
14

A.L.V.I. Bot

A.L.V.I. e' nato per essere un semplice ma modulare Bot, in grado di interagire con l'essere umano attraverso il linguaggio naturale ed eseguire svariati compiti, come leggere ad alta voce Mail, notizie, Feeds. Tutto in Italiano!

Downloads: 0 This Week

Last Update: 2014-12-21
See Project
15

P2TK: Penn Phonetics Toolkit

P2TK, the Penn Phonetics Toolkit, developed in the University of Pennsylvania Linguistics Department Phonetics Lab, is a collection of Python scripts and other tools to aid speech research.

Downloads: 0 This Week

Last Update: 2013-04-22
See Project
16

ASR-Builder

ASR-Builder provides an easy-to-use interface to the HTK toolkit, that allows users to build ASR systems. ASR-Builder provides a platform that performs house-keeping tasks when using HTK and also provides default training/testing/recognition scripts.

Downloads: 0 This Week

Last Update: 2013-04-26
See Project
17

Annotation Graph Toolkit

AGTK is a suite of software components for building tools for annotating linguistic signals, time-series data which documents any kind of linguistic behavior (e.g. audio, video). The internal data structures are based on annotation graphs.

Downloads: 3 This Week

Last Update: 2013-04-25
See Project
18

ftw. Text Modeller

Software to fit whole-sentence language models using the principle of maximum entropy. For developers of speech recognizers, text prediction interfaces, OCR, machine translation software.

Downloads: 0 This Week

Last Update: 2013-03-20
See Project
19

BATS (Blind Audio Tactile Mapping System

The Blind Audio Tactile Mapping System (BATS) attempts to address the lack of spatial information available for visually impaired students.

Downloads: 1 This Week

Last Update: 2014-05-09
See Project

Previous
You're on page 1
Next

Search Results for "speech"

Showing 19 open source projects for "speech"

pyVideoTrans

SPPAS

Auditory Modeling Toolbox

Tokenized Text Aligner

RDRPOSTagger

Distant Speech Recognition

yaafe

ACOPOST - a collection of POS taggers

Speech Research Tools

pyespeak

Voice keyboard

SWIPE' pitch extractor

PowerTalk

A.L.V.I. Bot

P2TK: Penn Phonetics Toolkit

ASR-Builder

Annotation Graph Toolkit

ftw. Text Modeller

BATS (Blind Audio Tactile Mapping System

Search Results for "speech"

Showing 19 open source projects for "speech"

pyVideoTrans

SPPAS

Auditory Modeling Toolbox

Tokenized Text Aligner

RDRPOSTagger

Distant Speech Recognition

yaafe

ACOPOST - a collection of POS taggers

Speech Research Tools

pyespeak

Voice keyboard

SWIPE' pitch extractor

PowerTalk

A.L.V.I. Bot

P2TK: Penn Phonetics Toolkit

ASR-Builder

Annotation Graph Toolkit

ftw. Text Modeller

BATS (Blind Audio Tactile Mapping System

Related Searches

Related Categories