Serving

Serving of ML models in Kubeflow

Overview

Model serving overview

KFServing

Model serving using KFServing

Istio Integration (for TF Serving)

Using Istio for TF Serving

Seldon Serving

Model serving using Seldon

NVIDIA TensorRT Inference Server

Model serving using TRT Inference Server

TensorFlow Serving

Serving TensorFlow models

TensorFlow Batch Predict

Batch prediction for TensorFlow models

PyTorch Serving

Instructions for serving a PyTorch model with Seldon