32 projects for "speech" with 2 filters applied:

  • Share your screen instantly while on a phone call with CrankWheel for an engaging presentation. Icon
    Share your screen instantly while on a phone call with CrankWheel for an engaging presentation.

    For salespeople and customer service agents who want to compliment their phone calls with visual elements.

    Our 10x simpler screen sharing tool is designed for you if you spend your days on the phone with clients, and need to add a visual presentation to close sales. No more scheduling a follow-up meeting, or teaching them to use a complex tool. Send them a text message or email, and they see your screen in seconds.
    Learn More
  • Empower Your Contact Center with Human-Like AI Conversations Icon
    Empower Your Contact Center with Human-Like AI Conversations

    Deliver faster resolutions, lower costs, and better CX without hiring another agent.

    Enterprise Bot, based in Switzerland, is a pioneer in Conversational AI, Process Automation, and Generative AI. With the trust of esteemed enterprise giants across industries like Generali, SIX, SBB, DHL, and SWICA, Enterprise Bot is revolutionizing both customer and employee experiences. Through its advanced integration with Large Language Models (LLM) such as ChatGPT and Llama 2, and its unique patent-pending DocBrain technology, the company delivers unparalleled personalization, active engagement, and omnichannel solutions across platforms like email, voice, and chat. Furthermore, Enterprise Bot integrates with existing core systems, such as SAP, CRMs, Confluence and more, and with its proprietary middleware, Blitzico, enables the AI to not only respond to queries but also take action to resolve them. This dedication to innovation in four main use case areas, Customer Support, Sales and Marketing, Knowledge Management and Digital Coworker, elevates both CX and employee productivity.
    Learn More
  • 1

    opencore-amr

    Audio codecs extracted from Android Open Source Project

    Library of OpenCORE Framework implementation of Adaptive Multi Rate Narrowband and Wideband (AMR-NB and AMR-WB) speech codec. Library of VisualOn implementation of Adaptive Multi Rate Wideband (AMR-WB) encoder and Advanced Audio Coding (AAC) encoder. Modified library of Fraunhofer AAC decoder and encoder.
    Leader badge
    Downloads: 7,071 This Week
    Last Update:
    See Project
  • 2
    Jason is a fully-fledged interpreter for an extended version of AgentSpeak, a BDI agent-oriented logic programming language, and is implemented in Java. Using JADE a multi-agent system can be distributed over a network effortlessly. This project was moved to https://jason-lang.github.io
    Downloads: 40 This Week
    Last Update:
    See Project
  • 3
    SVoice (Speech Voice Separation)

    SVoice (Speech Voice Separation)

    We provide a PyTorch implementation of the paper Voice Separation

    SVoice is a PyTorch-based implementation of Facebook Research’s study on speaker voice separation as described in the paper “Voice Separation with an Unknown Number of Multiple Speakers.” This project presents a deep learning framework capable of separating mixed audio sequences where several people speak simultaneously, without prior knowledge of how many speakers are present. The model employs gated neural networks with recurrent processing blocks that disentangle voices over multiple...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    Regulus is a Prolog-based toolkit for building spoken dialogue systems.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Ango Hub | All-in-one data labeling platform Icon
    Ango Hub | All-in-one data labeling platform

    For AI teams and Computer Vision team in organizations of all size

    AI-Assisted features of the Ango Hub will automate your AI data workflows to improve data labeling efficiency and model RLHF, all while allowing domain experts to focus on providing high-quality data.
    Learn More
  • 5
    FolioReaderKit

    FolioReaderKit

    A Swift ePub reader and parser framework for iOS

    ...Go to your storyboard file, choose or create the view controller that should present the epub reader. In the identity, the inspector set StoryboardFolioReaderContrainer as a class. Media Overlays (Sync text rendering with audio playback). TTS - Text to Speech Support, parse epub cover image, RTL Support. Vertical or/and Horizontal scrolling, share Custom Image Quotes NEW, supports multiple instances at same time, like parallel reading.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Swiss knife Java library
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    MARF is a general cross-platform framework with a collection of algorithms for audio (voice, speech, and sound) and natural language text analysis and recognition along with sample applications (identification, NLP, etc.) of its use, implemented in Java.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    A Python library to create sophisticated multilingual IVR applications. NOTICE. The repository is frozen, please find the latest version of the software at https://github.com/sippy/vapp
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    High-order HMM in Matlab

    Implementation of duration high-order hidden Markov model in Matlab.

    Implementation of duration high-order hidden Markov model (DHO-HMM) in Matlab with application in speech recognition.
    Downloads: 0 This Week
    Last Update:
    See Project
  • DeskTime is a cloud-based time tracking software Icon
    DeskTime is a cloud-based time tracking software

    DeskTime is best for medium to large companies, as well as freelancers who want to boost productivity without overworking.

    DeskTime is a high-performance, automated time tracking and workforce management solution for teams and freelancers. It runs silently in the background, logging computer activity from the moment of boot-up to ensure 100% accurate data without the need for manual timers.
    Learn More
  • 10
    InproTK

    InproTK

    An Incremental Spoken Dialogue Processing Toolkit

    InproTK is an Incremental Spoken Dialogue Processing Toolkit, that is, a toolkit to help you build dialogue systems that listen and talk incrementally, allowing for advanced interactional behaviour. Please see our Wiki for more information: http://sourceforge.net/p/inprotk/wiki/
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    SourceSquare

    SourceSquare

    The free open source scanning engine

    SourceSquare is a free open source scanning solution, based on Antepedia, the world's largest open source knwoledge base, including more than 1,000,000 open source projects from the main forges. SourceSquare is a free (as in free beer) and free (as in free speech, licensed under GNU Affero Public Licence V3) product, which scans a folder on your hard drive and displays a treemap that shows which part of your source code is open source.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    YABT is a context sensative string manipulation library. YABT currently can be used for Braille translation, but it is possible to put it to other uses such as preparation of text to be spoken by speech synthesisers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Dynamic tree of Java objects encapsulates hard-drive and Jar/Zip files (and their inner files) and Java objects all the same way. Create new ways of communication as executable Jar files, like a paint program that creates/uses paint programs as tools
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    A phrase to phoneme code converter for the SpeakJet chip by Magnevation. Speakalator runs on Unix type operating systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    http://www.open-source-soft.narod.ru/VoiceDetector.7z It voice detector class,for detecting human speech, anti noise, anti tones, standard C++, completely cross-platform by recompilation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    WOSH Framework - Wide Open Smart Home
    WOSH is a multi-platform message-oriented middleware written in ANSI C++. Service oriented architecture, designed for network distributed computing. Already working: Audio multimedia, X10, remote control (WinMobile, GTalk) and much more..
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    Scalable Language API (SLAPI) The most comprehensive architecture for conversational natural-language applications including speech recognition/synthesis, semantics, & machine translation. Used on Android & other mobile app platforms.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    Speech based User Interface Components Library for Java is a project to create Java controls and applications that can be used not only by literate people but also by non-literates. Speech and visual element with minimal text is used to create components
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    Virtual News Reader is a computer desktop application that can convert text (idealy for online news) into Virtual Voice and can be saved on computer. The project is using Java, (JSAPI), FreeTTS (Text-To-Speech synthesis), DJProject, Substance, & other.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    The jTTS project provides a refactored implementation of the FreeTTS software speech synthesizer. Rather than building upon the flite implementation of a TTS engine, it implements engine capabilities from Festival. It also supports British English.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    BladeWareVXML is a portable VoiceXML 2.1 interpreter that is an enhanced version (performance, usability and integration) of OpenVXI. A commercial version, with documentation, sample code, and support options, is available from the Commetrex Website.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Free Open Source VoiceXML editor programmed in Java (Swing). The VoiceXML document is regularly parsed, a tree view is built and syntax errors are reported in a specific table.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    An open PHP-based framework for the Nabaztag™ (http://www.nabaztag.com/) electronic pet. Due to major changes on the Violet backend, OpenNab can no longer be connected to it but it still can be used as a standalone server to set your bunnies free !
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    espeak-php is a PHP extension that uses espeak functions for speech synthesizing in many languages, such as English, Spanish, Portuguese and others.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Get your database online quickly using configuration not code. Use this secure, scalable and proven enterprise technology to publish any relational data, any custom process on the internet. MySQL, Java, XML, XSL. xHTML GUI, beta voiceXML and WML/WAP.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB