java open source free download

Showing 120 open source projects for "java open source"

View related business solutions

Speech Recognition Clear Filters & Widen Search

AestheticsPro Medical Spa Software
Our new software release will dramatically improve your medspa business performance while enhancing the customer experience

AestheticsPro is the most complete Aesthetics Software on the market today. HIPAA Cloud Compliant with electronic charting, integrated POS, targeted marketing and results driven reporting; AestheticsPro delivers the tools you need to manage your medical spa business. It is our mission To Provide an All-in-One Cutting Edge Software to the Aesthetics Industry.

Learn More
Skillfully - The future of skills based hiring
Realistic Workplace Simulations that Show Applicant Skills in Action

Skillfully transforms hiring through AI-powered skill simulations that show you how candidates actually perform before you hire them. Our platform helps companies cut through AI-generated resumes and rehearsed interviews by validating real capabilities in action. Through dynamic job specific simulations and skill-based assessments, companies like Bloomberg and McKinsey have cut screening time by 50% while dramatically improving hire quality.

Learn More
1

Vosk Speech Recognition Toolkit

Offline speech recognition API for Android, iOS, Raspberry Pi

Vosk is an offline open source speech recognition toolkit. It enables speech recognition for 20+ languages and dialects - English, Indian English, German, French, Spanish, Portuguese, Chinese, Russian, Turkish, Vietnamese, Italian, Dutch, Catalan, Arabic, Greek, Farsi, Filipino, Ukrainian, Kazakh, Swedish, Japanese, Esperanto, Hindi, Czech, Polish. More to come.

Downloads: 100 This Week

Last Update: 2024-04-22
See Project
2

whisper.cpp

Port of OpenAI's Whisper model in C/C++

whisper.cpp is a lightweight, C/C++ reimplementation of OpenAI’s Whisper automatic speech recognition (ASR) model—designed for efficient, standalone transcription without external dependencies. The entire high-level implementation of the model is contained in whisper.h and whisper.cpp. The rest of the code is part of the ggml machine learning library. The command downloads the base.en model converted to custom ggml format and runs the inference on all .wav samples in the folder samples....

Downloads: 358 This Week

Last Update: 2026-03-19
See Project
3

Whisper

Robust Speech Recognition via Large-Scale Weak Supervision

OpenAI Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification. A Transformer sequence-to-sequence model is trained on various speech processing tasks, including multilingual speech recognition, speech translation, spoken language identification, and voice activity detection. These tasks are jointly represented...

Downloads: 79 This Week

Last Update: 2025-06-26
See Project
4

OpenVINO

OpenVINO™ Toolkit repository

...This open-source version includes several components: namely Model Optimizer, OpenVINO™ Runtime, Post-Training Optimization Tool, as well as CPU, GPU, MYRIAD, multi device and heterogeneous plugins to accelerate deep learning inferencing on Intel® CPUs and Intel® Processor Graphics. It supports pre-trained models from the Open Model Zoo, along with 100+ open source and public models in popular formats such as TensorFlow, ONNX, PaddlePaddle, MXNet, Caffe, Kaldi.

Downloads: 37 This Week

Last Update: 2026-03-25
See Project
Award-Winning Medical Office Software Designed for Your Specialty
Succeed and scale your practice with cloud-based, data-backed, AI-powered healthcare software.

RXNT is an ambulatory healthcare technology pioneer that empowers medical practices and healthcare organizations to succeed and scale through innovative, data-backed, AI-powered software.

Learn More
5

Buster

Captcha solver extension for humans

Save time by asking Buster to solve captchas for you. Buster is a Firefox extension which helps you to solve difficult captchas by completing reCAPTCHA audio challenges using speech recognition. Challenges are solved by clicking on the extension button at the bottom of the reCAPTCHA widget. It is not guaranteed that challenges are always solved, the limitations of the technology need to be considered. The continued development of Buster is made possible thanks to the support of awesome...

Downloads: 38 This Week

Last Update: 2026-04-01
See Project
6

WhisperKit

On-device Speech Recognition for Apple Silicon

WhisperKit is a Swift package that integrates OpenAI's popular Whisper speech recognition model with Apple's CoreML framework for efficient, local inference on Apple devices. Whisper has pulled the future forward when fast, free and virtually error-free translation and transcription will be ubiquitous. It inspired numerous developers to improve and deploy it with minimal friction and maximum performance. We founded Argmax in November 2023 to empower developers and enterprises everywhere to...

Downloads: 4 This Week

Last Update: 2026-04-01
See Project
7

The SpeechBrain Toolkit

A PyTorch-based Speech Toolkit

SpeechBrain is an open-source and all-in-one conversational AI toolkit. It is designed to be simple, extremely flexible, and user-friendly. Competitive or state-of-the-art performance is obtained in various domains. SpeechBrain supports state-of-the-art methods for end-to-end speech recognition, including models based on CTC, CTC+attention, transducers, transformers, and neural language models relying on recurrent neural networks and transformers.

Downloads: 4 This Week

Last Update: 2026-03-30
See Project
8

NVIDIA NeMo

Toolkit for conversational AI

NVIDIA NeMo, part of the NVIDIA AI platform, is a toolkit for building new state-of-the-art conversational AI models. NeMo has separate collections for Automatic Speech Recognition (ASR), Natural Language Processing (NLP), and Text-to-Speech (TTS) models. Each collection consists of prebuilt modules that include everything needed to train on your data. Every module can easily be customized, extended, and composed to create new conversational AI model architectures. Conversational AI...

Downloads: 3 This Week

Last Update: 2026-03-23
See Project
9

Diffgram

Training data (data labeling, annotation, workflow) for all data types

From ingesting data to exploring it, annotating it, and managing workflows. Diffgram is a single application that will improve your data labeling and bring all aspects of training data under a single roof. Diffgram is world’s first truly open source training data platform that focuses on giving its users an unlimited experience. This is aimed to reduce your data labeling bills and increase your Training Data Quality. Training Data is the art of supervising machines through data. This includes the activities of annotation, which produces structured data; ready to be consumed by a machine learning model. ...

Downloads: 4 This Week

Last Update: 2024-10-14
See Project
Network Discovery Software | JDisc Discovery
JDisc Discovery supports the IT organizationss of medium-sized businesses and large-scale enterprises.

JDisc Discovery is a comprehensive network inventory and IT asset management solution designed to help organizations gain clear, up-to-date visibility into their IT environment. It automatically scans and maps devices across the network, including servers, workstations, virtual machines, and network hardware, to create a detailed inventory of all connected assets. This includes critical information such as hardware configurations, software installations, patch levels, and relationshipots between devices.

Learn More
10

Kaldi

kaldi-asr/kaldi is the official location of the Kaldi project

Kaldi is an open source toolkit for speech recognition research. It provides a powerful framework for building state-of-the-art automatic speech recognition (ASR) systems, with support for deep neural networks, Gaussian mixture models, hidden Markov models, and other advanced techniques. The toolkit is widely used in both academia and industry due to its flexibility, extensibility, and strong community support.

Downloads: 1 This Week

Last Update: 10 hours ago
See Project
11

FireRedASR

Open-source industrial-grade ASR models

FireRedASR is an industrial-grade family of open-source automatic speech recognition models designed to provide high-precision speech-to-text performance across languages including Mandarin, English, and various Chinese dialects, achieving new state-of-the-art benchmarks on public test sets. The project includes multiple model variants to meet different application needs, such as high-accuracy end-to-end interaction using an encoder-adapter-LLM framework and efficient real-time recognition using attention-based encoder-decoder architectures, giving developers flexibility in balancing performance and resource constraints. ...

Downloads: 0 This Week

Last Update: 2026-02-25
See Project
12

annyang!

Speech recognition for your site

annyang is a tiny javascript library that lets your visitors control your site with voice commands. annyang supports multiple languages, has no dependencies, weighs just 2kb and is free to use. annyang understands commands with named variables, splats, and optional words. Use named variables for one word arguments in your command. Use splats to capture multi-word text at the end of your command (greedy). Use optional words or phrases to define a part of the command as optional. annyang plays...

Downloads: 1 This Week

Last Update: 2026-03-11
See Project
13

Omnilingual ASR

Omnilingual ASR Open-Source Multilingual SpeechRecognition

Omnilingual-ASR is a research codebase exploring automatic speech recognition that generalizes across a very large number of languages using shared modeling and training recipes. It focuses on leveraging self-supervised audio pretraining and scalable fine-tuning so low-resource languages can benefit from high-resource data. The project provides data preparation pipelines, training scripts, decoding utilities, and evaluation tools so researchers can reproduce results and extend to new...

Downloads: 0 This Week

Last Update: 2025-12-12
See Project
14

Google2SRT

Download, save and convert multiple subtitles from YouTube videos

Google2SRT allows you to download, save and convert multiple subtitles and translations from YouTube and Google Video to SubRip (.srt) format, which is recognized by most video players. You can download XML subtitles or simply type video's URL, Google2SRT will do the rest.

33 Reviews

Downloads: 69 This Week

Last Update: 2025-01-11
See Project
15

Scribe

Free, open-source, and offline speech-to-text & voice control app.

> Scribe is a free and open-source desktop assistant that brings powerful speech-to-text and voice control capabilities directly to your PC. It allows you to dictate text into any application, create custom voice commands, launch programs, and automate your workflow with text replacements. > Designed with privacy as a top priority, Scribe works completely offline.

Downloads: 77 This Week

Last Update: 2025-12-13
See Project
16

WhisperJAV

A subtitle generator for Japanese Adult Videos.

A subtitle generator for Japanese Adult Videos. Transformer-based ASR architectures like Whisper suffer significant performance degradation when applied to the spontaneous and noisy domain of JAV. This degradation is driven by specific acoustic and temporal characteristics that defy the statistical distributions of standard training data.

1 Review

Downloads: 63 This Week

Last Update: 4 days ago
See Project
17

VoiceCommander Multilingual Offline

Offline multilingual voice control for Windows. Fast, private, local.

Return any reviews or problem to : ducktheapp@gmail.com VoiceCommander Multilingual Offline is a local voice command system for Windows, focused on speed, privacy, and simplicity. It runs entirely on your PC with no cloud services, no external APIs, and no internet connection. All voice recognition and commands are processed locally, ensuring full control of your data. The software allows you to control your computer using voice commands in multiple languages, including launching...

1 Review

Downloads: 34 This Week

Last Update: 2 days ago
See Project
18

Mice MX OS speech to text Voice Control

Mice speech to text with MX Cinnamon OS ISO

Note about this image This image contains a system based on Linux MX, which was created to improve accessibility within the Linux environment. The distribution uses the Cinnamon desktop interface, which is configured to be operated using voice commands and outputs. The user interface and the control of your own devices and home automation systems can be customized and extended. The voice control program MiceStTM.py was developed to enable easy adaptation to other languages. However, only...

Downloads: 3 This Week

Last Update: 5 hours ago
See Project
19

Comandi Vocali Offline per Windows

Sistema comandi vocali offline per Windows, veloce e privato .Offline

Comandi Vocali Offline per Windows è un sistema di controllo vocale che funziona interamente in locale sul tuo PC. Permette di controllare il computer con la voce senza connessione internet, senza cloud e senza inviare dati all’esterno. Il sistema è progettato per garantire massima privacy, velocità e semplicità. Caratteristiche principali: - Funziona completamente offline (nessun server, nessun cloud) - Riconoscimento vocale veloce con modelli locali - Controllo di browser,...

Downloads: 1 This Week

Last Update: 2026-04-07
See Project
20

Mice TTM

mice stt tts

Dieses Tool wird speziell für die Barrierefreiheit unter Linux entwickelt. Es ermöglicht das umwandeln/konvertieren/parsen von Texten die aus einer Spracherkennung stammen, in Diktate sowie das Ausführen von Makros. Dies funktioniert ohne Internet, da die Spracherkennung auf dem PC selbst erfolgt. Mausbewegungen auf benannte Wörter und dann entsprechend auswählen oder per Sprachbefehl klicken. Außerdem können Textpassagen z.B. unter Libreoffice Wirter per Sprachbefehl entsprechend...

Downloads: 0 This Week

Last Update: 2024-08-07
See Project
21

AzioSpeech Recognition and Translation

AzioSpeech Recognition and Translation

Starting from version 1.2.1.0, the project has been renamed to AzioSpeech Recognition and Translation and is officially published in the Microsoft Store at: https://apps.microsoft.com/detail/9PFV5DG73198 A desktop application built with Avalonia UI that provides real-time speech recognition and translation using Azure Speech Services. Convert spoken words into text and translate them into multiple languages with professional-grade accuracy. Important Setup Requirements Before using...

Downloads: 0 This Week

Last Update: 2026-02-13
See Project
22

Maia

MAIA (MyApp Intelligence Artificial) is designed to provide a foundation for building your own voice-controlled assistant with Python. It uses various libraries and modules for speech recognition, text-to-speech synthesis, and custom functionality.

Downloads: 1 This Week

Last Update: 2024-04-21
See Project
23

Vosk Desktop

Desktop software for controlling the Vosk Speech Recognition Toolkit

Downloads: 0 This Week

Last Update: 2023-08-10
See Project
24

ASRT Speech Recognition

A Deep-Learning-Based Chinese Speech Recognition System

ASRT is an end-to-end deep-learning Chinese ASR system built with TensorFlow/Keras, using convolution + CTC and a Max-Entropy HMM language model. It provides a REST/gRPC server backend and client SDKs in multiple languages (Python, Java, Go, Windows). Notably lightweight, it performs well without needing GPU acceleration and runs across platforms, targeting developers and researchers building Chinese voice interfaces.

Downloads: 0 This Week

Last Update: 2025-07-03
See Project
25

Flashlight library

A C++ standalone library for machine learning

Flashlight is a fast, flexible machine learning library written entirely in C++ by Facebook AI Research and the creators of Torch, TensorFlow, Eigen, and Deep Speech. Native support in C++ and simple extensibility make Flashlight a powerful research framework that's hackable to its core and enables fast iteration on new experimental setups and algorithms with little unopinionated and without sacrificing performance. In a single repository, Flashlight provides apps for research across...

Downloads: 1 This Week

Last Update: 2022-05-27
See Project