Multilingual Automatic Speech Recognition with word-level timestamps
GPU environment management and cluster orchestration
Run serverless GPU workloads with fast cold starts on bare-metal
Deep Learning API and Server in C++14 support for Caffe, PyTorch
A graphical manager for ollama that can manage your LLMs
A real time inference engine for temporal logical specifications