Kubeflow Training Operator is a Kubernetes-native project for fine-tuning and scalable distributed training of machine learning (ML) models created with various ML frameworks such as PyTorch, TensorFlow, XGBoost, MPI, Paddle, and others.

Features

  • TensorFlow Release Only
  • Python SDK for Kubeflow Training Operator
  • Documentation available
  • Examples available
  • TensorFlow API Definition
  • Use Kubernetes workloads to effectively train your large models via Kubernetes Custom Resources APIs
  • Use Training Operator Python SDK

Project Samples

Project Activity

See All Activity >

Categories

Machine Learning

License

Apache License V2.0

Follow Kubeflow Training Operator

Kubeflow Training Operator Web Site

Other Useful Business Software
Award-Winning Medical Office Software Designed for Your Specialty Icon
Award-Winning Medical Office Software Designed for Your Specialty

Succeed and scale your practice with cloud-based, data-backed, AI-powered healthcare software.

RXNT is an ambulatory healthcare technology pioneer that empowers medical practices and healthcare organizations to succeed and scale through innovative, data-backed, AI-powered software.
Learn More
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Kubeflow Training Operator!

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

Go

Related Categories

Go Machine Learning Software

Registered

2024-08-16