Run local LLMs like llama, deepseek, kokoro etc. inside your browser
OpenVINO™ Toolkit repository
Replace OpenAI GPT with another LLM in your app
Simplifies the local serving of AI models from any source
Sequence-to-sequence framework, focused on Neural Machine Translation
Guide to deploying deep-learning inference networks
Deploy a ML inference service on a budget in 10 lines of code